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(54) Title: COMPOSITIONS AND METHODS FOR THE THERAPY AND DIAGNOSIS OF LUNG CANCER 

£J (57) Abstract: Compositions and methods for the therapy and diagnosis of cancer, particularly lung cancer, are disclosed. Illustra- 
tive compositions comprise one or more lung tumor polypeptides, immunogenic portions thereof, polynucleotides that encode such 

Q polypeptides, antigen presenting cell that expresses such polypeptides, and T cells that are specific for cells expressing such polypep- 
tides. The disclosed compositions are useful, for example, in the diagnosis, prevention and/or treatment of diseases, particularly lung 
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COMPOSITIONS AND METHODS FOR THE THERAPY AND 
DIAGNOSIS OF LUNG CANCER 

TECHNICAL FIELD OF THE INVENTION 

The present invention relates generally to therapy and diagnosis of 
5 cancer, such as lung cancer. The invention is more specifically related to polypeptides, 
comprising at least a portion of a lung tumor protein, and to polynucleotides encoding 
such polypeptides. Such polypeptides and polynucleotides are useful in pharmaceutical 
compositions, e.g., vaccines, and other compositions for the diagnosis and treatment of 
lung cancer. 

1 0 BACKGROUND OF THE INVENTION 

Field of the Invention 

Cancer is a significant health problem throughout the world. Although 

advances have been made in detection and therapy of cancer, no vaccine or other 

universally successful method for prevention and/or treatment is currently available. 
15 ■ Current therapies, which are generally based on a combination of chemotherapy or 

surgery and radiation, continue to prove inadequate in many patients. 

Description of Related Art 

Lung cancer is the primary cause of cancer death among both men and 

women in the U.S., with an estimated 172,000 new cases being reported in 1994. The 
20 five-year survival rate among all lung cancer patients, regardless of the stage of disease 

at diagnosis, is only 13%. This contrasts with a five-year survival rate of 46% among 

cases detected while the disease is still localized. However, only 16% of lung cancers 

are discovered before the disease has spread. 

In spite of considerable research into therapies for these and other 
25 cancers, lung cancer remains difficult to diagnose and treat effectively. Accordingly, 

there is a need in the art for improved methods for detecting and treating such cancers. 

The present invention fulfills these needs and further provides other related advantages. 
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SUMMARY OF THE INVENTION 

In one aspect, the present invention provides polynucleotide 
compositions comprising a sequence selected from the group consisting of: 

(a) sequences provided in SEQ ID NO:l-3, 6-8, 10-13, 15-27, 29, 30, 
5 32, 34-49, 51, 52, 54, 55, 57-59, 61-69, 71, 73, 74, 77, 78, 80-82, 84, 86-96, 107-109, 
111, 113, 125, 127, 128, 129, 131-133, 142, 144, 148-151, 153, 154, 157, 158, 160, 
167, 168, 171, 179, 182, 184-186, 188-191, 193, 194, 198-207, 209, 210, 213, 214, 217, 
220-224, 253-337, 345, 347, 349, 358, 362, 364, 365, 368, 370-375, 420, 424, 428, 431, 
434, 442, 447, 450 and 467; 

10 (b) complements of the sequences provided in SEQ ID NO: 1-3, 6-8, 

10-13, 15-27, 29, 30, 32, 34-49, 51, 52, 54, 55, 57-59, 61-69, 71, 73, 74, 77, 78, 80-82, 
84, 86-96, 107-109, 111, 113, 125, 127, 128, 129, 131-133, 142, 144, 148-151, 153, 
154, 157, 158, 160, 167, 168, 171, 179, 182, 184-186, 188-191, 193, 194, 198-207, 209, 
210, 213, 214, 217, 220-224, 253-337, 345, 347, 349, 358, 362, 364, 365, 368, 370-375, 

15 420, 424, 428, 43 1 , 434, 442, 447, 450 and 467; 

(c) sequences consisting of at least 10, 15, 20, 25, 30, 35, 40, 45, 50, 
75 and 100 contiguous residues of a sequence provided in SEQ ID NO:l-3, 6-8, 10-13, 
15-27, 29, 30, 32, 34-49, 51, 52, 54, 55, 57-59, 61-69, 71, 73, 74, 77, 78, 80-82, 84, 86- 
96, 107-109, 111, 113, 125, 127, 128, 129, 131-133, 142, 144, 148-151, 153, 154, 157, 

20 158, 160, 167, 168, 171, 179, 182, 184-186, 188-191, 193, 194, 198-207, 209, 210, 213, 
214, 217, 220-224, 253-337, 345, 347, 349, 358, 362, 364, 365, 368, 370-375, 420, 424, 
428, 431, 434, 442, 447, 450 and 467; 

(d) sequences that hybridize to a sequence provided in SEQ ID 
NO:l-3, 6-8, 10-13, 15-27, 29, 30, 32, 34-49, 51, 52, 54, 55, 57-59, 61-69, 71, 73, 74, 

25 77, 78, 80-82, 84, 86-96, 107-109, 111, 113, 125, 127, 128, 129, 131-133, 142, 144, 
148-151, 153, 154, 157, 158, 160, 167, 168, 171, 179, 182, 184-186, 188-191, 193, 194, 
198-207, 209, 210, 213, 214, 217, 220-224, 253-337, 345, 347, 349, 358, 362, 364, 365, 
368, 370-375, 420, 424, 428, 431, 434, 442, 447, 450 and 467, under moderate or 
highly stringent conditions; 

30 (e) sequences having at least 75%, 80%, 85%, 90%, 95%, 96%, 

97%, 98% or 99% identity to a sequence of SEQ ID NO:l-3, 6-8, 10-13, 15-27, 29, 30, 
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32, 34-49, 51, 52, 54, 55, 57-59, 61-69, 71, 73, 74, 77, 78, 80-82, 84, 86-96, 107-109, 
111, 113, 125, 127, 128, 129, 131-133, 142, 144, 148-151, 153, 154, 157, 158, 160, 
167, 168, 171, 179, 182, 184-186, 188-191, 193, 194, 198-207, 209, 210, 213, 214, 217, 
220-224, 253-337, 345, 347, 349, 358, 362, 364, 365, 368, 370-375, 420, 424, 428, 431, 
5 434, 442, 447, 450 and 467; and 

(f) degenerate variants of a sequence provided in SEQ ID NO: 1-3, 6- 
8, 10-13, 15-27, 29, 30, 32, 34-49, 51, 52, 54, 55, 57-59, 61-69, 71, 73, 74, 77, 78, 80- 
82, 84, 86-96, 107-109, 111, 113, 125, 127, 128, 129, 131-133, 142, 144, 148-151, 153, 
154, 157, 158, 160, 167, 168, 171, 179, 182, 184-186, 188-191, 193, 194, 198-207, 209, 

10 210, 213, 214, 217, 220-224, 253-337, 345, 347, 349, 358, 362, 364, 365, 368, 370-375, 
420, 424, 428, 431, 434, 442, 447, 450 and 467. 

In one preferred embodiment, the polynucleotide compositions of the 
invention are expressed in at least about 20%, more preferably in at least about 30%, 
and most preferably in at least about 50% of lung tumors samples tested, at a level that 

15 is at least about 2-fold, preferably at least about 5-fold, and most preferably at least 
about 10-fold higher than that for normal tissues. 

The present invention, in another aspect, provides polypeptide 
compositions comprising an amino acid sequence that is encoded by a polynucleotide 
sequence described above. 

20 The present invention further provides polypeptide compositions 

comprising an amino acid sequence selected from the group consisting of sequences 
recited in SEQ IDNO:152, 155, 156, 165, 166, 169, 170, 172, 174, 176, 226-252, 338- 
344, 346, 350, 357, 361, 363, 365, 367, 369, 376-382, 387-419, 423, 427, 430, 433, 
441, 443, 446, 449, 451-466 and 468-469. 

25 In certain preferred embodiments, the polypeptides and/or 

polynucleotides of the present invention are immunogenic, i.e., they are capable of 
eliciting an immune response, particularly a humoral and/or cellular immune response, 
as further described herein. 

The present invention further provides fragments, variants and/or 

30 derivatives of the disclosed polypeptide and/or polynucleotide sequences, wherein the 
fragments, variants and/or derivatives preferably have a level of immunogenic activity 
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of at least about 50%, preferably at least about 70% and more preferably at least about 
90% of the level of immunogenic activity of a polypeptide sequence set forth in SEQ ID 
NO.152, 155, 156, 165, 166, 169, 170, 172, 174, 176, 226-252, 338-344, 346, 350, 357, 
361, 363, 365, 367, 369, 376-382, 387-419, 423, 427, 430, 433, 441, 443, 446, 449 and 
5 451-466, or a polypeptide sequence encoded by a polynucleotide sequence set forth in 
SEQ ID NO: 1-3, 6-8, 10-13, 15-27, 29, 30, 32, 34-49, 51, 52, 54, 55, 57-59, 61-69, 71, 
73, 74, 77, 78, 80-82, 84, 86-96, 107-109, 111, 113, 125, 127, 128, 129, 131-133, 142, 
144, 148-151, 153, 154, 157, 158, 160, 167, 168, 171, 179, 182, 184-186, 188-191, 193, 
194, 198-207, 209, 210, 213, 214, 217, 220-224, 253-337, 345, 347, 349, 358, 362, 364, 
10 365, 368, 370-375, 420, 424, 428, 43 1, 434, 442, 447, 450 and 467. 

The present invention further provides polynucleotides that encode a 
polypeptide described above, expression vectors comprising such polynucleotides and 
host cells transformed or transfected with such expression vectors. 

Within other aspects, the present invention provides pharmaceutical 
15 compositions comprising a polypeptide or polynucleotide as described above and a 
physiologically acceptable carrier. 

Within a related aspect of the present invention, the pharmaceutical 
compositions, e.g., vaccine compositions, are provided for prophylactic or therapeutic 
applications. Such compositions generally comprise an immunogenic polypeptide or 
20 polynucleotide of the invention and an immuno stimulant, such as an adjuvant. 

The present invention further provides pharmaceutical compositions that 
comprise: (a) an antibody or antigen-binding fragment thereof that specifically binds to 
a polypeptide of the present invention, or a fragment thereof; and (b) a physiologically 
acceptable carrier. 

25 Within further aspects, the present invention provides pharmaceutical 

compositions comprising: (a) an antigen presenting cell that expresses a polypeptide as 
described above and (b) a pharmaceutically acceptable carrier or excipient. Illustrative 
antigen presenting cells include dendritic cells, macrophages, monocytes, fibroblasts 
andB cells. 



4 



WO 02/47534 



PCT7US01/47576 



Within related aspects, pharmaceutical compositions are provided that 
comprise: (a) an antigen presenting cell that expresses a polypeptide as described above 
and (b) an immuno stimulant. 

The present invention further provides, in other aspects, fusion proteins 
5 that comprise at least one polypeptide as described above, as well as polynucleotides 
encoding such fusion proteins, typically in the form of pharmaceutical compositions, 
e.g., vaccine compositions, comprising a physiologically acceptable carrier and/or an 
immunostimulant. The fusions proteins may comprise multiple immunogenic 
polypeptides or portions/variants thereof, as described herein, and may further comprise 
10 one or more polypeptide segments for facilitating the expression, purification and/or 
immunogenicity of the polypeptide(s). 

Within further aspects, the present invention provides methods for 
stimulating an immune response in a patient, preferably a T cell response in a human 
patient, comprising administering a pharmaceutical composition described herein. The 
15 patient may be afflicted with lung cancer, in which case the methods provide treatment 
for the disease, or patient considered at risk for such a disease may be treated 
prophylactically. 

Within further aspects, the present invention provides methods for 
inhibiting the development of a cancer in a patient, comprising administering to a 
20 patient a pharmaceutical composition as recited above. The patient may be afflicted 
with lung cancer, in which case the methods provide treatment for the disease, or patient 
considered at risk for such a disease may be treated prophylactically. 

The present invention further provides, within other aspects, methods for 
removing tumor cells from a biological sample, comprising contacting a biological 
25 sample with T cells that specifically react with a polypeptide of the present invention, 
wherein the step of contacting is performed under conditions and for a time sufficient to 
permit the removal of cells expressing the protein from the sample. 

Within related aspects, methods are provided for inhibiting the 
development of a cancer in a patient, comprising administering to a patient a biological 
30 sample treated as described above. 
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Methods are further provided, within other aspects, for stimulating 
and/or expanding T cells specific for a polypeptide of the present invention, comprising 
contacting T cells with one or more of: (i) a polypeptide as described above; (ii) a 
polynucleotide encoding such a polypeptide; and/or (iii) an antigen presenting cell that 
5 expresses such a polypeptide; under conditions and for a time sufficient to permit the 
stimulation and/or expansion of T cells. Isolated T cell populations comprising T cells 
prepared as described above are also provided. 

Within further aspects, the present invention provides methods for 
inhibiting the development of a cancer in a patient, comprising administering to a 
10 patient an effective amount of a T cell population as described above. 

The present invention further provides methods for inhibiting the 
development of a cancer in a patient, comprising the steps of: (a) incubating CD4+ 
and/or CD8 + T cells isolated from a patient with one or more of: (i) a polypeptide 
comprising at least an immunogenic portion of polypeptide disclosed herein; (ii) a 
15 polynucleotide encoding such a polypeptide; and (iii) an antigen-presenting cell that 
expressed such a polypeptide; and (b) administering to the patient an effective amount 
of the proliferated T cells, and thereby inhibiting the development of a cancer in the 
patient. Proliferated cells may, but need not, be cloned prior to administration to the 
patient. 

20 Within further aspects, the present invention provides methods for 

determining the presence or absence of a cancer, preferably a lung cancer, in a patient 
comprising: (a) contacting a biological sample obtained from a patient with a binding 
agent that binds to a polypeptide as recited above; (b) detecting in the sample an amount 
of polypeptide that binds to the binding agent; and (c) comparing the amount of 

25 polypeptide with a predetermined cut-off value, and therefrom determining the presence 
or absence of a cancer in the patient. Within preferred embodiments, the binding agent 
is an antibody, more preferably a monoclonal antibody. 

The present invention also provides, within other aspects, methods for 
monitoring the progression of a cancer in a patient. Such methods comprise the steps 

30 of: (a) contacting a biological sample obtained from a patient at a first point in time 
with a binding agent that binds to a polypeptide as recited above; (b) detecting in the 
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sample an amount of polypeptide that binds to the binding agent; (c) repeating steps (a) 
and (b) using a biological sample obtained from the patient at a subsequent point in 
time; and (d) comparing the amount of polypeptide detected in step (c) with the amount 
detected in step (b) and therefrom monitoring the progression of the cancer in the 
5 patient. 

The present invention further provides, within other aspects, methods for 
determining the presence or absence of a cancer in a patient, comprising the steps of: (a) 
contacting a biological sample obtained from a patient with an oligonucleotide that 
hybridizes to a polynucleotide that encodes a polypeptide of the present invention; (b) 

1 0 detecting in the sample a level of a polynucleotide, preferably mRNA, that hybridizes to 
the oligonucleotide; and (c) comparing the level of polynucleotide that hybridizes to the 
oligonucleotide with a predetermined cut-off Value, and therefrom determining the 
presence or absence of a cancer in the patient. Within certain embodiments, the amount 
of mRNA is detected via polymerase chain reaction using, for example, at least one 

15 oligonucleotide primer that hybridizes to a polynucleotide encoding a polypeptide as 
recited above, or a complement of such a polynucleotide. Within other embodiments, 
the amount of mRNA is detected using a hybridization technique, employing an 
oligonucleotide probe that hybridizes to a polynucleotide that encodes a polypeptide as 
recited above, or a complement of such a polynucleotide. 

20 In related aspects, methods are provided for monitoring the progression 

of a cancer in a patient, comprising the steps of: (a) contacting a biological sample 
obtained from a patient with an oligonucleotide that hybridizes to a polynucleotide that 
encodes a polypeptide of the present invention; (b) detecting in the sample an amount of 
a polynucleotide that hybridizes to the oligonucleotide; (c) repeating steps (a) and (b) 

25 using a biological sample obtained from the patient at a subsequent point in time; and 
(d) comparing the amount of polynucleotide detected in step (c) with the amount 
detected in step (b) and therefrom monitoring the progression of the cancer in the 
patient. 

Within further aspects, the present invention provides antibodies, such as 
30 monoclonal antibodies, that bind to a polypeptide as described above, as well as 
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diagnostic kits comprising such antibodies. Diagnostic kits comprising one or more 
oligonucleotide probes or primers as described above are also provided. 

These and other aspects of the present invention will become apparent 
upon reference to the following detailed description. All references disclosed herein are 
5 hereby incorporated by reference in their entirety as if each was incorporated 
individually. 

SEQUENCE IDENTIFIERS 

SEQ ID NO:l is the determined cDNA sequence for LST-S1-2 
SEQ ID NO:2 is the determined cDNA sequence for LST-S1-28 

10 SEQ ID NO:3 is the determined cDNA sequence for LST-S1-90 
SEQ ID NO:4 is the determined cDNA sequence for LST-S1-144 
SEQ ID NO: 5 is the determined cDNA sequence for LST-S1-133 
SEQ ID NO:6 is the determined cDNA sequence for LST-S1-169 
SEQ ID NO:7 is the determined cDNA sequence for LST-S2-6 

1 5 SEQ ID NO : 8 is the determined cDNA sequence for LST-S2- 1 1 
SEQ ID NO:9 is the determined cDNA sequence for LST-S2-17 
SEQ ID NO: 10 is the determined cDNA sequence for LST-S2-25 
SEQ ID NO: 1 1 is the determined cDNA sequence for LST-S2-39 
SEQ ID NO: 12 is a first determined cDNA sequence for LST-S2-43 

20 SEQ ID NO: 13 is a second determined cDNA sequence for LST-S2-43 
SEQ ID NO: 14 is the determined cDNA sequence for LST-S2-65 
SEQ ID NO: 1 5 is the determined cDNA sequence for LST-S2-68 
SEQ ID NO: 16 is the determined cDNA sequence for LST-S2-72 
SEQ ID NO: 17 is the determined cDNA sequence for LST-S2-74 

25 SEQ ID NO: 1 8 is the determined cDNA sequence for LST-S2-103 
SEQ ID NO: 19 is the determined cDNA sequence for LST-S2-N1-1F 
SEQ ID NO:20 is the determined cDNA sequence for LST-S2-N1-2A 
SEQ ID NO:21 is the determined cDNA sequence for LST-S2-N1-4H 
SEQ ID NO:22 is the determined cDNA sequence for LST-S2-N1-5A 

30 SEQ ID NO:23 is the determined cDNA sequence for LST-S2-N1-6B 
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SEQ ID NO:24 is the determined cDNA sequence for LST-S2-N1-7B 
SEQ ID NO:25 is the determined cDNA sequence for LST-S2-N1-7H 
SEQ ID NO:26 is the determined cDNA sequence for LST-S2-N1-8A 
SEQ ID NO:27 is the determined cDNA sequence for LST-S2-N1-8D 
5 SEQ ID NO:28 is the determined cDNA sequence for LST-S2-N1-9A 
SEQ ID NO:29 is the determined cDNA sequence for LST-S2-N1-9E 
SEQ ID NO:30 is the determined cDNA sequence for LST-S2-N1-10A 
SEQ ID NO:31 is the determined cDNA sequence for LST-S2-N1-10G 
SEQ ID NO:32 is the determined cDNA sequence for LST-S2-N1-1 1 A 

10 SEQ ID NO:33 is the determined cDNA sequence for LST-S2-N1-12C 
SEQ ID NO:34 is the determined cDNA sequence for LST-S2-N1-12E 
SEQ ID NO:35 is the determined cDNA sequence for LST-S2-B1-3D 
SEQ ID NO:36 is the determined cDNA sequence for LST-S2-B1-6C 
SEQ ID NO:37 is the determined cDNA sequence for LST-S2-B1-5D 

15 SEQ ID NO:38 is the determined cDNA sequence for LST-S2-B1-5F 
SEQ ID NO:39 is the determined cDNA sequence for LST-S2-B1-6G 
SEQ ID NO:40 is the determined cDNA sequence for LST-S2-B1-8A 
SEQ ID NO:41 is the determined cDNA sequence for LST-S2-B1-8D 
SEQ ID NO:42 is the determined cDNA sequence for LST-S2-B1-10A 

20 SEQ ID NO:43 is the determined cDNA sequence for LST-S2-B1-9B 
SEQ ID NO:44 is the determined cDNA sequence for LST-S2-B1-9F 
SEQ ID NO:45 is the determined cDNA sequence for LST-S2-B1-12D 
SEQ ID NO:46 is the determined cDNA sequence for LST-S2-I2-2B 
SEQ ID NO:47 is the determined cDNA sequence for LST-S2-I2-5F 

25 SEQ ID NO:48 is the determined cDNA sequence for LST-S2-I2-6B 
SEQ ID NO:49 is the determined cDNA sequence for LST-S2-I2-7F 
SEQ ID NO: 50 is the determined cDNA sequence for LST-S2-I2-8G 
SEQ ID NO:51 is the determined cDNA sequence for LST-S2-I2-9E 
SEQ ID NO:52 is the determined cDNA sequence for LST-S2-I2-12B 

30 SEQ ID NO:53 is the determined cDNA sequence for LST-S2-H2-2C 
SEQ ID NO:54 is the determined cDNA sequence for LST-S2-H2-1G 
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SEQ ID NO:55 is the determined cDNA sequence for LST-S2-H2-4G 
SEQ ID NO:56 is the determined cDNA sequence for LST-S2-H2-3H 
SEQ ID NO:57 is the determined cDNA sequence for LST-S2-H2-5G 
SEQ ID NO:58 is the determined cDNA sequence for LST-S2-H2-9B 
5 SEQ ID NO:59 is the determined cDNA sequence for LST-S2-H2-10H 
SEQ ID NO:60 is the determined cDNA sequence for LST-S2-H2-12D 
SEQ ID NO: 61 is the determined cDNA sequence for LST-S3-2 
SEQ ID NO: 62 is the determined cDNA sequence for LST-S3-4 
SEQ ID NO: 63 is the determined cDNA sequence for LST-S3-7 

10 SEQ ID NO: 64 is the determined cDNA sequence for LST-S3-8 
SEQ ID NO: 65 is the determined cDNA sequence for LST-S3-12 
SEQ ID NO: 66 is the determined cDNA sequence for LST-S3-13 
SEQ ID NO: 67 is the determined cDNA sequence for LST-S3-14 
SEQ ID NO: 68 is the determined cDNA sequence for LST-S3-16 

15 SEQ ID NO: 69 is the determined cDNA sequence for LST-S3-21 
SEQ ID NO: 70 is the determined cDNA sequence for LST-S3-22 
SEQ ID NO: 71 is the determined cDNA sequence for LST-S1-7 
SEQ ID NO: 72 is the determined cDNA sequence for LST-S1-A-1E 
SEQ ID NO: 73 is the determined cDNA sequence for LST-S1-A-1G 

20 SEQ ID NO: 74 is the determined cDNA sequence for LST-S 1 -A-3E 
SEQ ID NO: 75 is the determined cDNA sequence for LST-S 1-A-4E 
SEQ ID NO: 76 is the determined cDNA sequence for LST-S 1-A-6D 
SEQ ID NO: 77 is the determined cDNA sequence for LST-S 1-A-8D 
SEQ ID NO: 78 is the determined cDNA sequence for LST-S1-A-10A 

25 SEQ ID NO: 79 is the determined cDNA sequence for LST-S1-A-10C 
SEQ ID NO: 80 is the determined cDNA sequence for LST-S 1-A-9D 
SEQ ID NO: 81 is the determined cDNA sequence for LST-S1-A-10D 
SEQ ID NO: 82 is the determined cDNA sequence for LST-S 1-A-9H 
SEQ ID NO: 83 is the determined cDNA sequence for LST-S1-A-1 ID 

30 SEQ ID NO: 84 is the determined cDNA sequence for LST-S1-A-12D 
SEQ ID NO: 85 is the determined cDNA sequence for LST-S1-A-1 IE 
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SEQ ID NO: 86 is the determined cDNA sequence for LST-S1-A-12E 
SEQ ID NO: 87 is the determined cDNA sequence for L513S (T3). 
SEQ ID NO: 88 is the determined cDNA sequence for L513S contig 1. 
SEQ ID NO: 89 is a first determined cDNA sequence for L514S. 
5 SEQ ID NO: 90 is a second determined cDNA sequence for L514S. 
SEQ ID NO: 91 is a first determined cDNA sequence for L516S. 
SEQ ID NO: 92 is a second determined cDNA sequence for L516S. 
SEQ ID NO: 93 is the determined cDNA sequence for L517S. 

SEQ ID NO: 94 is the extended cDNA sequence for LST-S1-169 (also known as 
10 L519S). 

SEQ ID NO: 95 is a first determined cDNA sequence for L520S. 

SEQ ID NO: 96 is a second determined cDNA sequence for L520S. 

SEQ ID NO: 97 is a first determined cDNA sequence for L521S. 

SEQ ID NO: 98 is a second determined cDNA sequence for L521S. 
1 5 SEQ ID NO : 99 is the determined cDNA sequence for L522S. 

SEQ ID NO: 100 is the determined cDNA sequence for L523S. 

SEQ ID NO: 101 is the determined cDNA sequence for L524S. 

SEQ ID NO: 102 is the determined cDNA sequence for L525S. 

SEQ ID NO: 103 is the determined cDNA sequence for L526S. 
20 SEQ ID NO: 1 04 is the determined cDNA sequence for L527S. 

SEQ ID NO: 105 is the determined cDNA sequence for L528S. 

SEQ ID NO: 106 is the determined cDNA sequence for L529S. 

SEQ ID NO: 107 is a first determined cDNA sequence for L530S. 

SEQ ID NO: 108 is a second determined cDNA sequence for L530S. 
25 SEQ ID NO: 1 09 is the determined full-length cDNA sequence for L53 1 S short form 

SEQ ID NO: 1 10 is the amino acid sequence encoded by SEQ ID NO: 109. 

SEQ ID NO: 1 1 1 is the determined full-length cDNA sequence for L53 IS long form 

SEQ ID NO: 1 12 is the amino acid sequence encoded by SEQ ID NO: 111. 

SEQ ID NO: 1 13 is the determined full-length cDNA sequence for L520S. 
30 SEQ ID NO: 1 14 is the amino acid sequence encoded by SEQ ID NO: 1 13. 

SEQ ID NO: 1 15 is the determined cDNA sequence for contig 1. 
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SEQ ID NO: 1 16 is the determined cDNA sequence for contig 3. 
SEQ ID NO: 1 17 is the determined cDNA sequence for contig 4. 
SEQ ID NO: 118 is the determined cDNA sequence for contig 5. 
SEQ ID NO: 1 19 is the determined cDNA sequence for contig 7. 
5 SEQ ID NO: 120 is the determined cDNA sequence for contig 8. 
SEQ ID NO: 121 is the determined cDNA sequence for contig 9. 
SEQ ID NO: 122 is the determined cDNA sequence for contig 10. 
SEQ ID NO: 123 is the determined cDNA sequence for contig 12. 
SEQ ID NO: 124 is the detennined cDNA sequence for contig 1 1 . 
10 SEQ ID NO: 125 is the detennined cDNA sequence for contig 13 (also known as 
L761P). 

SEQ ID NO: 126 is the determined cDNA sequence for contig 15. 

SEQ ID NO: 127 is the determined cDNA sequence for contig 16. 

SEQ ID NO: 128 is the determined cDNA sequence for contig 17. 
1 5 SEQ ID NO: 129 is the determined cDNA sequence for contig 1 9. 

SEQ ID NO: 130 is the determined cDNA sequence for contig 20. 

SEQ ID NO: 131 is the determined cDNA sequence for contig 22. 

SEQ ID NO: 132 is the determined cDNA sequence for contig 24. 

SEQ ID NO: 133 is the determined cDNA sequence for contig 29. 
20 SEQ ID NO: 1 34 is the determined cDNA sequence for contig 3 1 . 

SEQ ID NO: 135 is the determined cDNA sequence for contig 33. 

SEQ ID NO: 136 is the determined cDNA sequence for contig 38. 

SEQ ID NO: 137 is the determined cDNA sequence for contig 39. 

SEQ ID NO: 138 is the determined cDNA sequence for contig 41. 
25 SEQ ID NO: 1 39 is the determined cDNA sequence for contig 43 . 

SEQ ID NO: 140 is the determined cDNA sequence for contig 44. 

SEQ ID NO: 141 is the determined cDNA sequence for contig 45. 

SEQ ID NO: 142 is the detennined cDNA sequence for contig 47. 

SEQ ID NO: 143 is the determined cDNA sequence for contig 48. 
30 SEQ ID NO: 144 is the determined cDNA sequence for contig 49. 

SEQ ID NO: 145 is the determined cDNA sequence for contig 50. 
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SEQ ID NO: 161 is the amino acid sequence encoded by SEQ ID NO: 160. 

SEQ ID NO: 162 is the determined cDNA sequence for L515S. 
20 SEQ ID NO : 1 63 is the full-length cDNA sequence of a first variant of L524S . 

SEQ ID NO: 164 is the full-length cDNA sequence of a second variant of L524S. 

SEQ ID NO: 165 is the amino acid sequence encoded by SEQ ID NO: 163. 

SEQ ID NO: 166 is the amino acid sequence encoded by SEQ ID NO: 164. 

SEQ ID NO: 167 is the full-length cDNA sequence of a first variant of L762P. 
25 SEQ ID NO : 1 68 is the full-length cDNA sequence of a second variant of L762P . 

SEQ ID NO: 169 is the amino acid sequence encoded by SEQ ID NO: 167. 

SEQ ID NO: 170 is the amino acid sequence encoded by SEQ ID NO: 168. 

SEQ ID NO: 171 is the full-length cDNA sequence for L773P (also referred to as contig 

56). 

30 SEQ ID NO: 172 is the amino acid sequence encoded by SEQ ID NO: 171. 
SEQ ID NO: 173 is an extended cDNA sequence for L519S. 
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SEQ ID NO: 205 is the determined cDNA sequence for LST-sub6-12e. 

SEQ ID NO: 206 is the determined cDNA sequence for LST-sub6-12f. 

SEQ ID NO: 207 is the determined cDNA sequence for LST-sub6-l lg. 

SEQ ID NO: 208 is the determined cDNA sequence for LST-sub6-12g. 
5 SEQ ID NO: 209 is the determined cDNA sequence for LST-sub6-12h. 

SEQ ID NO: 210 is the determined cDNA sequence for LST-sub6-II-la. 

SEQ ID NO: 21 1 is the determined cDNA sequence for LST-sub6-II-2b. 

SEQ ID NO: 212 is the determined cDNA sequence for LST-sub6-II-2g. 

SEQ ID NO: 213 is the determined cDNA sequence for LST-sub6-II-lh. 
10 SEQ ID NO: 214 is the determined cDNA sequence for LST-sub6-II-4a. 

SEQ ID NO: 215 is the determined cDNA sequence for LST-sub6-II-4b. 

SEQ ID NO: 216 is the determined cDNA sequence for LST-sub6-II-3e. 

SEQ ID NO: 217 is the determined cDNA sequence for LST-sub6-II-4f. 

SEQ ID NO: 218 is the determined cDNA sequence for LST-sub6-II-4g. 
1 5 SEQ ID NO : 2 1 9 is the determined cDNA sequence for LST-sub6-II-4h. 

SEQ ID NO: 220 is the determined cDNA sequence for LST-sub6-II-5c. 

SEQ ID NO: 221 is the determined cDNA sequence for LST-sub6-II-5e. 

SEQ ID NO: 222 is the determined cDNA sequence for LST-sub6-II-6f. 

SEQ ID NO: 223 is the determined cDNA sequence for LST-sub6-II-5g. 
20 SEQ ID NO : 224 is the determined cDNA sequence for LST-sub6-II-6g. 

SEQ ID NO: 225 is the amino acid sequence for L528S. 

SEQ ID NO: 226-251 are synthetic peptides derived from L762P. 

SEQ ID NO: 252 is the expressed amino acid sequence of L514S. 

SEQ ID NO: 253 is the DNA sequence corresponding to SEQ ID NO: 252. 
25 SEQ ID NO : 254 is the DNA sequence of a L762P expression construct. 

SEQ ID NO: 255 is the determined cDNA sequence for clone 23785. 

SEQ ID NO: 256 is the determined cDNA sequence for clone 23786. 

SEQ ID NO: 257 is the determined cDNA sequence for clone 23788. 

SEQ ID NO: 258 is the determined cDNA sequence for clone 23790. 
30 SEQ ID NO: 259 is the determined cDNA sequence for clone 23793. 

SEQ ID NO: 260 is the determined cDNA sequence for clone 23794. 
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SEQ ID NO: 261 is the determined cDNA sequence for clone 23795. 

SEQ ID NO: 262 is the determined cDNA sequence for clone 23796. 

SEQ ID NO: 263 is the determined cDNA sequence for clone 23797. 

SEQ ID NO: 264 is the determined cDNA sequence for clone 23798. 
5 SEQ ID NO: 265 is the determined cDNA sequence for clone 23799. 

SEQ ID NO: 266 is the determined cDNA sequence for clone 23800. 

SEQ ID NO: 267 is the determined cDNA sequence for clone 23802. 

SEQ ID NO: 268 is the determined cDNA sequence for clone 23803. 

SEQ ID NO: 269 is the determined cDNA sequence for clone 23804. 
1 0 SEQ ID NO : 270 is the determined cDNA sequence for clone 23 805 . 

SEQ ID NO: 271 is the determined cDNA sequence for clone 23806. 

SEQ ID NO: 272 is the determined cDNA sequence for clone 23807. 

SEQ ID NO: 273 is the determined cDNA sequence for clone 23808. 

SEQ ID NO: 274 is the determined cDNA sequence for clone 23809. 
15 SEQ ID NO: 275 is the determined cDNA sequence for clone 23810. 

SEQ ID NO: 276 is the determined cDNA sequence for clone 2381 1. 

SEQ ID NO: 277 is the determined cDNA sequence for clone 23812. 

SEQ ID NO: 278 is the determined cDNA sequence for clone 23813. 

SEQ ID NO: 279 is the determined cDNA sequence for clone 23815. 
20 SEQ ID NO: 280 is the determined cDNA sequence for clone 25298. 

SEQ ID NO: 281 is the determined cDNA sequence for clone 25299. 

SEQ ID NO: 282 is the determined cDNA sequence for clone 25300. 

SEQ ID NO: 283 is the determined cDNA sequence for clone 25301 

SEQ ID NO: 284 is the determined cDNA sequence for clone 25304 
25 SEQ ID NO: 285 is the determined cDNA sequence for clone 25309. 

SEQ ID NO: 286 is the determined cDNA sequence for clone 25312. 

SEQ ID NO: 287 is the determined cDNA sequence for clone 25317. 

SEQ ID NO:288 is the determined cDNA sequence for clone 25321. 

SEQ ID NO:289 is the determined cDNA sequence for clone 25323. 
30 SEQ ID NO:290 is the determined cDNA sequence for clone 25327. 

SEQ ID NO:291 is the determined cDNA sequence for clone 25328. 
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SEQ ID NO:292 is the determined cDNA sequence for clone 25332. 

SEQ ID NO:293 is the determined cDNA sequence for clone 25333. 

SEQ ID NO:294 is the determined cDNA sequence for clone 25336. 

SEQ ID NO:295 is the determined cDNA sequence for clone 25340. 
5 SEQ ID NO:296 is the determined cDNA sequence for clone 25342. 

SEQ ID NO:297 is the determined cDNA sequence for clone 25356. 

SEQ ID NO:298 is the determined cDNA sequence for clone 25357. 

SEQ ID NO:299 is the determined cDNA sequence for clone 25361. 

SEQ ID NO-.300 is the determined cDNA sequence for clone 25363. 
1 0 SEQ ID NO:301 is the determined cDNA sequence for clone 25397. 

SEQ ID NO:302 is the determined cDNA sequence for clone 25402. 

SEQ ID NO:303 is the determined cDNA sequence for clone 25403. 

SEQ ID NO:304 is the determined cDNA sequence for clone 25405. 

SEQ ID NO:305 is the determined cDNA sequence for clone 25407. 
1 5 SEQ ID NO:306 is the determined cDNA sequence for clone 25409. 

SEQ ID NO:307 is the determined cDNA sequence for clone 25396. 

SEQ ID NO:308 is the determined cDNA sequence for clone 25414. 

SEQ ID NO.-309 is the determined cDNA sequence for clone 25410. 

SEQ ID NO:310 is the determined cDNA sequence for clone 25406. 
20 SEQ ID NO:3 1 1 is the determined cDNA sequence for clone 25306. 

SEQ ID NO:3 12 is the determined cDNA sequence for clone 25362. 

SEQ ID NO:313 is the determined cDNA sequence for clone 25360. 

SEQ ID NO:314 is the determined cDNA sequence for clone 25398. 

SEQ ID NO:3 15 is the determined cDNA sequence for clone 25355. 
25 SEQ ID NO:3 16 is the determined cDNA sequence for clone 2535 1 . 

SEQ ID NO:317 is the determined cDNA sequence for clone 25331. 

SEQ ID NO:3 18 is the determined cDNA sequence for clone 25338. 

SEQ ID NO:319 is the determined cDNA sequence for clone 25335. 

SEQ ID NO:320 is the determined cDNA sequence for clone 25329. 
30 SEQ ID NO:32 1 is the determined cDNA sequence for clone 25324. 

SEQ ID NO:322 is the determined cDNA sequence for clone 25322. 
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SEQ ID NO:323 is the determined cDNA sequence for clone 25319. 
SEQ ID NO:324 is the determined cDNA sequence for clone 25316. 
SEQ ID NO:325 is the determined cDNA sequence for clone 2531 1. 
SEQ ID NO:326 is the determined cDNA sequence for clone 25310. 
5 SEQ ID NO:327 is the determined cDNA sequence for clone 25302. 
SEQ ID NO:328 is the determined cDNA sequence for clone 25315. 
SEQ ID NO-.329 is the determined cDNA sequence for clone 25308. 
SEQ ID NO:330 is the determined cDNA sequence for clone 25303. 
SEQ ID NO:33 1-337 are the cDNA sequences of isoforms of the p53 tumor suppressor 
1 0 homologue, p63 (also referred to as L530S). 

SEQ ID NO:338-344 are the amino acid sequences encoded by SEQ ID NO:33 1-337, 
respectively 

SEQ ID NO:345 is a second cDNA sequence for the antigen L763P. 
SEQ ID NO: 3 46 is the amino acid sequence encoded by the sequence of SEQ ID NO: 
15 345. 

SEQ ID NO:347 is a determined full-length cDNA sequence for L523S. 
SEQ ID NO:348 is the amino acid sequence encoded by SEQ ID NO: 347. 
SEQ ID NO:349 is the cDNA sequence encoding the N-terminal portion of L773P. 
SEQ ID NO:350 is the amino acid sequence of the N-terminal portion of L773P. 
20 SEQ ID NO:351 is the DNA sequence for a fusion of Ral2 and the N-terminal portion 
ofL763P. 

SEQ ID NO:352 is the amino acid sequence of the fusion of Ral2 and the N-terminal 
portion ofL763P. 

SEQ ID NO:353 is the DNA sequence for a fusion of Ral2 and the C-terminal portion 
25 ofL763P. 

SEQ ID NO:354 is the amino acid sequence of the fusion of Ral2 and the C-terminal 
portion ofL763P. 
SEQ ID NO:355 is a primer. 
SEQ ID NO:356 is a primer. 
30 SEQ ID NO:357 is the protein sequence of expressed recombinant L762P. 
SEQ ID NO:358 is the DNA sequence of expressed recombinant L762P. 
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SEQ ID NO:359 is a primer. 
SEQ ID NO:360 is a primer. 

SEQ ID NO:361 is the protein sequence of expressed recombinant L773P A. 
SEQ ID NO:362 is the DNA sequence of expressed recombinant L773P A. 
5 SEQ ID NO:363 is an epitope derived from clone L773P polypeptide. 

SEQ ID NO:364 is a polynucleotide encoding the polypeptide of SEQ ID NO:363. 
SEQ ID NO:365 is an epitope derived from clone L773P polypeptide. 
SEQ ID NO:366 is a polynucleotide encoding the polypeptide of SEQ ID NO:365. 
SEQ ID NO:367 is an epitope consisting of amino acids 571-590 of SEQ ID NO:161, 
10 clone L762P. 

SEQ ID NO:368 is the full-length DNA sequence for contig 13 (SEQ ID NO:125), also 
referred to as L761P. 

SEQ ID NO:369 is the protein sequence encoded by the DNA sequence of SEQ ID 
NO:368. 

15 SEQ ID NO:370 is an L762P DNA sequence from nucleotides 2071-2130. 

SEQ ID NO:371 is an L762P DNA sequence from nucleotides 1441-1500. 

SEQ ID NO:372 is an L762P DNA sequence from nucleotides 1936-1955. 

SEQ ID NO:373 is an L762P DNA sequence from nucleotides 2620-2679. 

SEQ ID NO:374 is an L762P DNA sequence from nucleotides 1801-1860. 
20 SEQ ID NO:375 is an L762P DNA sequence from nucleotides 1531-1591. 

SEQ ID NO:376 is the amino acid sequence of the L762P peptide encoded by SEQ ID 

NO:373. 

SEQ ID NO:377 is the amino acid sequence of the L762P peptide encoded by SEQ ID 
NO:370. 

25 SEQ ID NO:378 is the amino acid sequence of the L762P peptide encoded by SEQ ID 
NO:372. 

SEQ ID NO:379 is the amino acid sequence of the L762P peptide encoded by SEQ ID 
NO:374. 

SEQ ID NO-.380 is the amino acid sequence of the L762P peptide encoded by SEQ ID 
30 NO:371. 
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SEQ ID NO:381 is the amino acid sequence of the L762P peptide encoded by SEQ ID 
NO:375. 

SEQ ID NO: 3 82 is the amino acid sequence of an epitope of L762P. 

SEQ ID NO:383-386 are PCR primers. 
5 SEQ ID NO:387-395 are the amino acid sequences of L773P peptides. 

SEQ ID NO:396-419 are the amino acid sequences of L523S peptides. 

SEQ ID NO:420 is the determined cDNA sequence for clone #19014. 

SEQ ID NO:421 is the forward primer PDM-278 for the L514S-13160 coding region. 

SEQ ID NO:422 is the reverse primer PDM-278 for the L514S-13160 coding region. 
10 SEQ ID NO:423 is the amino acid sequence for the expressed recombinant L514S. 

SEQ ID NO:424 is the DNA coding sequence for the recombinant L5 14S. 

SEQ ID NO: 425 is the forward primer PDM-414 for the L523S coding region. 

SEQ ID NO:426 is the reverse primer PDM-414 for the L523S coding region. 

SEQ ID NO:427 is the amino acid sequence for the expressed recombinant L523S. 
1 5 SEQ ID NO:428 is the DNA coding sequence for the recombinant L523S. 

SEQ ID NO:429 is the reverse primer PDM-279 for the L762PA coding region. 

SEQ ID NO:430 is the amino acid sequence for the expressed recombinant L762PA. 

SEQ ID NO:431 is the DNA coding sequence for the recombinant L762PA. 

SEQ ID NO:432 is the reverse primer PDM-300 for the L773P coding region. 
20 SEQ ID NO:433 is the amino acid sequence of the expressed recombinant L773P. 

SEQ ID NO:434 is the DNA coding sequence for the recombinant L773P. 

SEQ ID NO:435 is the forward primer for TCR ValphaS. 

SEQ ID NO:436 is the reverse primer for TCR Valpha8. 

SEQ ID NO:437 is the forward primer for TCR Vbeta8. 
25 SEQ ID NO :43 8 is the reverse primer for TCR Vbeta8 . 

SEQ ID NO:439 is the TCR Vaipha DNA sequence of the TCR clone specific for the 

lung antigen L762P. 

SEQ ID NO:440 is the TCR Vbeta DNA sequence of the TCR clone specific for the 
lung antigen L762P. 
30 SEQ ID NO:441 is the amino acid sequence of L763 peptide #2684. 
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SEQ ID NO:442 is the predicted full-length cDNA for the cloned partial sequence of 
clone L529S (SEQ ID NO:106). 

SEQ ID NO:443 is the deduced amino acid sequence encoded by SEQ ID NO:442. 
SEQ ID NO:444 is the forward primer PDM-734 for the coding region of clone L523S. 
5 SEQ ID NO:445 is the reverse primer PDM-735 for the coding region of clone L523S. 
SEQ ID NO:446 is the amino acid sequence for the expressed recombinant L523S. 
SEQ ID NO:447 is the DNA coding sequence for the recombinant L523S. 
SEQ ID NO:448 is another forward primer PDM-733 for the coding region of clone 
L523S. 

1 0 SEQ ID NO:449 is the amino acid sequence for a second expressed recombinant L523S. 
SEQ ID NO:450 is the DNA coding sequence for a second recombinant L523S. 
SEQ ID NO:451 corresponds to amino acids 86-110, an epitope of L514S-specific in 
the generation of antibodies. 

SEQ ID NO:452 corresponds to amino acids 21-45, an epitope of L514S-specific in the 
1 5 generation of antibodies. 

SEQ ID NO:453 corresponds to amino acids 121-135, an epitope of L514S-specific in 
the generation of antibodies. 

SEQ ID NO:454 corresponds to amino acids 440-460, an epitope of L523S-specific in 
the generation of antibodies. 
20 SEQ ID NO:455 corresponds to amino acids 156-175, an epitope of L523S-specific in 
the generation of antibodies. 

SEQ ID NO:456 corresponds to amino acids 326-345, an epitope of L523S-specific in 
the generation of antibodies. 

SEQ ID NO:457 corresponds to amino acids 40-59, an epitope of L523S-specific in the 
25 generation of antibodies. 

SEQ ID NO-.458 corresponds to amino acids 80-99, an epitope of L523S-specific in the 
generation of antibodies. 

SEQ ID NO:459 corresponds to amino acids 160-179, an epitope of L523S-specific in 
the generation of antibodies. 
30 SEQ ID NO:460 corresponds to amino acids 180-199, an epitope of L523S -specific in 
the generation of antibodies. 
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SEQ ID NO:461 corresponds to amino acids 320-339, an epitope of L523S-specific in 
the generation of antibodies. 

SEQ ID NO:462 corresponds to amino acids 340-359, an epitope of L523S-specific in 
the generation of antibodies. 
5 SEQ ID NO:463 corresponds to amino acids 370-389, an epitope of L523S-specific in 
the generation of antibodies. 

SEQ ID NO:464 corresponds to amino acids 380-399, an epitope of L523S-specific in 
the generation of antibodies. 

SEQ ID NO:465 corresponds to amino acids 37-55, an epitope of L523S-recognized by 
1 0 the L523S-specific CTL line 6B1 . 

SEQ ID NO:466 corresponds to amino acids 41-51, the mapped antigenic epitope of 
L523S-recognized by the L523S-specific CTL line 6B1 . 

SEQ ID NO.-467 corresponds to the DNA sequence which encodes SEQ ID NO:466. 
SEQ ID NO:468 corresponds to the amino acids of peptide 16, 17 of hL523S. 
1 5 SEQ ID NO:469 corresponds to the amino acids of peptide 16, 17 of mL523S 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention is directed generally to compositions and their use 
in the therapy and diagnosis of cancer, particularly lung cancer. As described further 
below, illustrative compositions of the present invention include, but are not restricted 
20 to, polypeptides, particularly immunogenic polypeptides, polynucleotides encoding such 
polypeptides, antibodies and other binding agents, antigen presenting cells (APCs) and 
immune system cells (e.g., T cells). 

The practice of the present invention will employ, unless indicated 
specifically to the contrary, conventional methods of virology, immunology, 

25 microbiology, molecular biology and recombinant DNA techniques within the skill of 
the art, many of which are described below for the purpose of illustration. Such 
techniques are explained fully in the literature. See, e.g., Sambrook, et al. Molecular 
Cloning: A Laboratory Manual (2nd Edition, 1989); Maniatis et al. Molecular Cloning: 
A Laboratory Manual (1982); DNA Cloning: A Practical Approach, vol. I & II (D. 

30 Glover, ed.); Oligonucleotide Synthesis (N. Gait, ed., 1984); Nucleic Acid 
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Hybridization (B. Hames & S. Higgins, eds., 1985); Transcription and Translation (B. 
Hames & S. Higgins, eds., 1984); Animal Cell Culture (R. Freshney, ed., 1986); Perbal, 
A Practical Guide to Molecular Cloning (1984). 

All publications, patents and patent applications cited herein, whether 
5 supra or infra, are hereby incorporated by reference in their entirety. 

As used in this specification and the appended claims, the singular forms 
"a," "an" and "the" include plural references unless the content clearly dictates 
otherwise. 

Polypeptide Compositions 

10 As used herein, the term "polypeptide" " is used in its conventional 

meaning, i.e., as a sequence of amino acids. The polypeptides are not limited to a 
specific length of the product; thus, peptides, oligopeptides, and proteins are included 
within the definition of polypeptide, and such terms may be used interchangeably herein 
unless specifically indicated otherwise. This term also does not refer to or exclude post- 

15 expression modifications of the polypeptide, for example, glycosylations, acetylations, 
phosphorylations and the like, as well as other modifications known in the art, both 
naturally occurring and non-naturally occurring. A polypeptide may be an entire 
protein, or a subsequence thereof. Particular polypeptides of interest in the context of 
this invention are amino acid subsequences comprising epitopes, i.e., antigenic 

20 determinants substantially responsible for the immunogenic properties of a polypeptide 
and being capable of evoking an immune response. 

Particularly illustrative polypeptides of the present invention comprise 
those encoded by a polynucleotide sequence set forth in any one of SEQ ID NO: 1-3, 6- 
8, 10-13, 15-27, 29, 30, 32, 34-49, 51, 52, 54, 55, 57-59, 61-69, 71, 73, 74, 77, 78, 80- 

25 82, 84, 86-96, 107-109, 111, 113, 125, 127, 128, 129, 131-133, 142, 144, 148-151, 153, 
154, 157, 158, 160, 167, 168, 171, 179, 182, 184-186, 188-191, 193, 194, 198-207, 209, 
210, 213, 214, 217, 220-224, 253-337, 345, 347, 349, 358, 362, 364, 365, 368, 370-375, 
420, 424, 428, 431, 434, 442, 447, 450 and 467, or a sequence that hybridizes under 
moderately stringent conditions, or, alternatively, under highly stringent conditions, to a 

30 polynucleotide sequence set forth in any one of SEQ ID NO:l-3, 6-8, 10-13, 15-27, 29, 
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30, 32, 34-49, 51, 52, 54, 55, 57-59, 61-69, 71, 73, 74, 77, 78, 80-82, 84, 86-96, 107- 
109, 111, 113, 125, 127, 128, 129, 131-133, 142, 144, 148-151, 153, 154, 157, 158, 
160, 167, 168, 171, 179, 182, 184-186, 188-191, 193, 194, 198-207, 209, 210, 213, 214, 
217, 220-224, 253-337, 345, 347, 349, 358, 362, 364, 365, 368, 370-375, 420, 424, 428, 
5 431, 434, 442, 447, 450 and 467. Certain illustrative polypeptides of the invention 
comprise amino acid sequences as set forth in any one of SEQ ID NO: 152, 155, 156, 
165, 166, 169, 170, 172, 174, 176, 226-252, 338-344, 346, 350, 357, 361, 363, 365, 
367, 369, 376-382, 387-419, 423, 427, 430, 433, 441, 443, 446, 449, 451-466 and 468- 
469. 

10 The polypeptides of the present invention are sometimes herein referred 

to as lung tumor proteins or lung tumor polypeptides, as an indication that their 
identification has been based at least in part upon their increased levels of expression in 
lung tumor samples. Thus, a "lung tumor polypeptide" or "lung tumor protein," refers 
generally to a polypeptide sequence of the present invention, or a polynucleotide 

1 5 sequence encoding such a polypeptide, that is expressed in a substantial proportion of 
lung tumor samples, for example preferably greater than about 20%, more preferably 
greater than about 30%, and most preferably greater than about 50% or more of lung 
rumor samples tested, at a level that is at least two fold, and preferably at least five fold, 
greater than the level of expression in normal tissues, as determined using a 

20 representative assay provided herein. A lung tumor polypeptide sequence of the 
invention, based upon its increased level of expression in tumor cells, has particular 
utility both as a diagnostic marker as well as a therapeutic target, as further described 
below. 

In certain preferred embodiments, the polypeptides of the invention are 
25 immunogenic, i.e., they react detectably within an immunoassay (such as an ELISA or 
T-cell stimulation assay) with antisera and/or T-cells from a patient with lung cancer. 
Screening for immunogenic activity can be performed using techniques well known to 
the skilled artisan. For example, such screens can be performed using methods such as 
those described in Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring 
30 Harbor Laboratory, 1988. In one illustrative example, a polypeptide may be 
immobilized on a solid support and contacted with patient sera to allow binding of 
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antibodies within the sera to the immobilized polypeptide. Unbound sera may then be 
removed and bound antibodies detected using, for example, 125 I-labeled Protein A. 

As would be recognized by the skilled artisan, immunogenic portions of 
the polypeptides disclosed herein are also encompassed by the present invention. An 
5 "immunogenic portion," as used herein, is a fragment of an immunogenic polypeptide 
of the invention that itself is immunologically reactive (i.e., specifically binds) with the 
B-cells and/or T-cell surface antigen receptors that recognize the polypeptide. 
Immunogenic portions may generally be identified using well known techniques, such 
as those summarized in Paul, Fundamental Immunology, 3rd ed., 243-247 (Raven Press, 

10 1993) and references cited therein. Such techniques include screening polypeptides for 
the ability to react with antigen-specific antibodies, antisera and/or T-cell lines or 
clones. As used herein, antisera and antibodies are "antigen-specific" if they 
specifically bind to an antigen (i.e., they react with the protein in an ELISA or other 
immunoassay, and do not react detectably with unrelated proteins). Such antisera and 

1 5 antibodies may be prepared as described herein, and using well-known techniques. 

In one preferred embodiment, an immunogenic portion of a polypeptide 
of the present invention is a portion that reacts with antisera and/or T-cells at a level that 
is not substantially less than the reactivity of the full-length polypeptide (e.g., in an 
ELISA and/or T-cell reactivity assay). Preferably, the level of immunogenic activity of 

20 the immunogenic portion is at least about 50%, preferably at least about 70% and most 
preferably greater than about 90% of the immunogenicity for the full-length 
polypeptide. In some instances, preferred immunogenic portions will be identified that 
have a level of immunogenic activity greater than that of the corresponding full-length 
polypeptide, e.g., having greater than about 100% or 150% or more immunogenic 

25 activity. 

In certain other embodiments, illustrative immunogenic portions may 
include peptides in which an N-terminal leader sequence and/or transmembrane domain 
have been deleted. Other illustrative immunogenic portions will contain a small N- 
and/or C-terminal deletion (e.g., 1-30 amino acids, preferably 5-15 amino acids), 
30 relative to the mature protein. 
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In another embodiment, a polypeptide composition of the invention may 
also comprise one or more polypeptides that are immunologically reactive with T cells 
and/or antibodies generated against a polypeptide of the invention, particularly a 
polypeptide having an amino acid sequence disclosed herein, or to an immunogenic 
5 fragment or variant thereof. 

In another embodiment of the invention, polypeptides are provided that 
comprise one or more polypeptides that are capable of eliciting T cells and/or antibodies 
that are immunologically reactive with one or more polypeptides described herein, or 
one or more polypeptides encoded by contiguous nucleic acid sequences contained in 

10 the polynucleotide sequences disclosed herein, or immunogenic fragments or variants 
thereof, or to one or more nucleic acid sequences which hybridize to one or more of 
these sequences under conditions of moderate to high stringency. 

The present invention, in another aspect, provides polypeptide fragments 
comprising at least about 5, 10, 15, 20, 25, 50, or 100 contiguous amino acids, or more, 

15 including all intermediate lengths, of a polypeptide compositions set forth herein, such 
as those set forth in SEQ ID NO:152, 155, 156, 165, 166, 169, 170, 172, 174, 176, 226- 
252, 338-344, 346, 350, 357, 361, 363, 365, 367, 369, 376-382 and 387-419, 441, 443, 
446, 449, 451-466 and 468-469, or those encoded by a polynucleotide sequence set 
forth in a sequence of SEQ ID NO:l-3, 6-8, 10-13, 15-27, 29, 30, 32, 34-49, 51, 52, 54, 

20 55, 57-59, 61-69, 71, 73, 74, 77, 78, 80-82, 84, 86-96, 107-109, 111, 113, 125, 127, 
128, 129, 131-133, 142, 144, 148-151, 153, 154, 157, 158, 160, 167, 168, 171, 179, 
182, 184-186, 188-191, 193, 194, 198-207, 209, 210, 213, 214, 217, 220-224, 253-337, 
345, 347, 349, 358, 362, 364, 365, 368, 370-375, 420, 424, 428, 431, 434, 442, 447, 450 
and 467. 

25 In another aspect, the present invention provides variants of the 

polypeptide compositions described herein. Polypeptide variants generally 
encompassed by the present invention will typically exhibit at least about 70%, 75%, 
80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% or more identity 
(determined as described below), along its length, to a polypeptide sequences set forth 

30 herein. 
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In one preferred embodiment, the polypeptide fragments and variants 
provide by the present invention are immunologically reactive with an antibody and/or 
T-cell that reacts with a full-length polypeptide specifically set for the herein. 

In another preferred embodiment, the polypeptide fragments and variants 
5 provided by the present invention exhibit a level of immunogenic activity of at least 
about 50%, preferably at least about 70%, and most preferably at least about 90% or 
more of that exhibited by a full-length polypeptide sequence specifically set forth 
herein. 

A polypeptide "variant," as the term is used herein, is a polypeptide that 
10 typically differs from a polypeptide specifically disclosed herein in one or more 
substitutions, deletions, additions and/or insertions. Such variants may be naturally 
occurring or may be synthetically generated, for example, by modifying one or more of 
the above polypeptide sequences of the invention and evaluating their immunogenic 
activity as described herein and/or using any of a number of techniques well known in 
15 the art. 

For example, certain illustrative variants of the polypeptides of the 
invention include those in which one or more portions, such as an N-terminal leader 
sequence or transmembrane domain, have been removed. Other illustrative variants 
include variants in which a small portion {e.g., 1-30 amino acids, preferably 5-15 amino 

20 acids) has been removed from the N- and/or C-terminal of the mature protein. 

In many instances, a variant will contain conservative substitutions. A 
"conservative substitution" is one in which an amino acid is substituted for another 
amino acid that has similar properties, such that one skilled in the art of peptide 
chemistry would expect the secondary structure and hydropathic nature of the 

25 polypeptide to be substantially unchanged. As described above, modifications may be 
made in the structure of the polynucleotides and polypeptides of the present invention 
and still obtain a functional molecule that encodes a variant or derivative polypeptide 
with desirable characteristics, e.g., with immunogenic characteristics. When it is 
desired to alter the amino acid sequence of a polypeptide to create an equivalent, or 

30 even an improved, immunogenic variant or portion of a polypeptide of the invention, 
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one skilled in the art will typically change one or more of the codons of the encoding 
DNA sequence according to Table 1. 

For example, certain amino acids may be substituted for other amino 
acids in a protein structure without appreciable loss of interactive binding capacity with 
5 structures such as, for example, antigen-binding regions of antibodies or binding sites 
on substrate molecules. Since it is the interactive capacity and nature of a protein that 
defines that protein's biological functional activity, certain amino acid sequence 
substitutions can be made in a protein sequence, and, of course, its underlying DNA 
coding sequence, and nevertheless obtain a protein with like properties. It is thus 
10 contemplated that various changes may be made in the peptide sequences of the 
disclosed compositions, or corresponding DNA sequences which encode said peptides 
without appreciable loss of their biological utility or activity. 
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Table 1 



Amino Acids Codons 



Alanine 


Ala 


A 


GCA 


GCC 


GCG 


GCU 






Cysteine 


Cys 


C 


UGC 


UGU 










Aspartic acid 


Asp 


D 


GAC 


GAU 










Glutamic acid 


Glu 


E 


GAA 


GAG 










Phenylalanine 


Phe 


F 


UUC 


uuu 










Glycine 


Gly 


G 


GGA 


GGC 


GGG 


GGU 






Histidine 


His 


H 


CAC 


CAU 










Isoleucine 


He 


I 


AUA 


AUC 


AUU 








Lysine 


Lys 


K 


AAA 


AAG 










Leucine 


Leu 


L 


UUA 


UUG 


CUA 


cue 


CUG 


CUU 


Methionine 


Met 


M 


AUG 












Asparagine 


Asn 


N 


AAC 


AAU 










Proline 


Pro 


P 


CCA 


CCC 


CCG 


ecu 






Glutamine 


Gin 


Q 


CAA 


CAG 










Arginine 


Arg 


R 


AGA 


AGG 


CGA 


CGC 


CGG 


CGU 


Serine 


Ser 


s 


AGC 


AGU 


UCA 


UCC 


UCG 


UCU 


Threonine 


Thr 


T 


ACA 


ACC 


ACG 


ACU 






Valine 


Val 


V 


GUA 


GUC 


GUG 


GUU 






Tryptophan 


Trp 


w 


UGG 












Tyrosine 


Tyr 


Y 


UAC 


UAU 











In making such changes, the hydropathic index of amino acids may be 
considered. The importance of the hydropathic amino acid index in conferring 
5 interactive biologic function on a protein is generally understood in the art (Kyte and 
Doolittle, 1982, incorporated herein by reference). It is accepted that the relative 
hydropathic character of the amino acid contributes to the secondary structure of the 
resultant protein, which in turn defines the interaction of the protein with other 
molecules, for example, enzymes, substrates, receptors, DNA, antibodies, antigens, and 
10 the like. Each amino acid has been assigned a hydropathic index on the basis of its 
hydrophobicity and charge characteristics (Kyte and Doolittle, 1982). These values are: 
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isoleucine (+4.5); valine (+4.2); leucine (+3.8); phenylalanine (+2.8); cysteine/cystine 
(+2.5); methionine (+1.9); alanine (+1.8); glycine (-0.4); threonine (-0.7); serine (-0.8); 
tryptophan (-0.9); tyrosine (-1.3); proline (-1.6); histidine (-3.2); glutamate (-3.5); 
glutamine (-3.5); aspartate (-3.5); asparagine (-3.5); lysine (-3.9); and arginine (-4.5). 
5 It is known in the art that certain amino acids may be substituted by other 

amino acids having a similar hydropathic index or score and still result in a protein with 
similar biological activity, i.e. still obtain a biological functionally equivalent protein. 
In making such changes, the substitution of amino acids whose hydropathic indices are 
within ±2 is preferred, those within +1 are particularly preferred, and those within ±0.5 

10 are even more particularly preferred. It is also understood in the art that the substitution 
of like amino acids can be made effectively on the basis of hydrophilicity. U. S. Patent 
4,554,101 (specifically incorporated herein by reference in its entirety), states that the 
greatest local average hydrophilicity of a protein, as governed by the hydrophilicity of 
its adjacent amino acids, correlates with a biological property of the protein. 

15 As detailed in U. S. Patent 4,554, 1 0 1 , the following hydrophilicity values 

have been assigned to amino acid residues: arginine (+3.0); lysine (+3.0); aspartate 
(+3.0 ± 1); glutamate (+3.0 ± 1); serine (+0.3); asparagine (+0.2); glutamine (+0.2); 
glycine (0); threonine (-0.4); proline (-0.5 + 1); alanine (-0.5); histidine (-0.5); cysteine 
(-1.0); methionine (-1.3); valine (-1.5); leucine (-1.8); isoleucine (-1.8); tyrosine (- 

20 2.3); phenylalanine (-2.5); tryptophan (-3.4). It is understood that an amino acid can be 
substituted for another having a similar hydrophilicity value and still obtain a 
biologically equivalent, and in particular, an immunologically equivalent protein. In 
such changes, the substitution of amino acids whose hydrophilicity values are within +2 
is preferred, those within ±1 are particularly preferred, and those within ±0.5 are even 

25 more particularly preferred. 

As outlined above, amino acid substitutions are generally therefore based 
on the relative similarity of the amino acid side-chain substituents, for example, their 
hydrophobicity, hydrophilicity, charge, size, and the like. Exemplary substitutions that 
take various of the foregoing characteristics into consideration are well known to those 

30 of skill in the art and include: arginine and lysine; glutamate and aspartate; serine and 
threonine; glutamine and asparagine; and valine, leucine and isoleucine. 
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In addition, any polynucleotide may be further modified to increase 
stability in vivo. Possible modifications include, but are not limited to, the addition of 
flanking sequences at the 5' and/or 3' ends; the use of phosphorothioate or 2' O-methyl 
rather than phosphodiesterase linkages in the backbone; and/or the inclusion of 
5 nontraditional bases such as inosine, queosine and wybutosine, as well as acetyl- 
methyl-, thio- and other modified forms of adenine, cytidine, guanine, thymine and 
uridine. 

Amino acid substitutions may further be made on the basis of similarity 
in polarity, charge, solubility, hydrophobicity, hydrophilicity and/or the amphipathic 

10 nature of the residues. For example, negatively charged amino acids include aspartic 
acid and glutamic acid; positively charged amino acids include lysine and arginine; and 
amino acids with uncharged polar head groups having similar hydrophilicity values 
include leucine, isoleucine and valine; glycine and alanine; asparagine and glutamine; 
and serine, threonine, phenylalanine and tyrosine. Other groups of amino acids that may 

15 represent conservative changes include: (l)ala, pro, gly, glu, asp, gin, asn, ser, thr; 
(2) cys, ser, tyr, thr; (3) val, ile, leu, met, ala, phe; (4) lys, arg, his; and (5) phe, tyr, trp, 
his. A variant may also, or alternatively, contain non-conservative changes. In a 
preferred embodiment, variant polypeptides differ from a native sequence by 
substitution, deletion or addition of five amino acids or fewer. Variants may also (or 

20 alternatively) be modified by, for example, the deletion or addition of amino acids that 
have minimal influence on the immunogenicity, secondary structure and hydropathic 
nature of the polypeptide. 

As noted above, polypeptides may comprise a signal (or leader) sequence 
at the N-terminal end of the protein, which co-translationally or post-translationally 

25 directs transfer of the protein. The polypeptide may also be conjugated to a linker or 
other sequence for ease of synthesis, purification or identification of the polypeptide 
{e.g., poly-His), or to enhance binding of the polypeptide to a solid support. For 
example, a polypeptide may be conjugated to an immunoglobulin Fc region. 

When comparing polypeptide sequences, two sequences are said to be 

30 "identical" if the sequence of amino acids in the two sequences is the same when 
aligned for maximum correspondence, as described below. Comparisons between two 
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sequences are typically performed by comparing the sequences over a comparison 
window to identify and compare local regions of sequence similarity. A "comparison 
window" as used herein, refers to a segment of at least about 20 contiguous positions, 
usually 30 to about 75, 40 to about 50, in which a sequence may be compared to a 
5 reference sequence of the same number of contiguous positions after the two sequences 
are optimally aligned. 

Optimal alignment of sequences for comparison may be conducted using 
the Megalign program in the Lasergene suite of bioinformatics software (DNASTAR, 
Inc., Madison, WI), using default parameters. This program embodies several 

10 alignment schemes described in the following references: Dayhoff, M.O. (1978) A 
model of evolutionary change in proteins - Matrices for detecting distant relationships. 
In Dayhoff, M.O. (ed.) Atlas of Protein Sequence and Structure, National Biomedical 
Research Foundation, Washington DC Vol. 5, Suppl. 3, pp. 345-358; Hein J. (1990) 
Unified Approach to Alignment and Phylogenes pp. 626-645 Methods in Enzymology 

15 vol. 183, Academic Press, Inc., San Diego, CA; Higgins, D.G. and Sharp, P.M. (1989) 
CABIOS 5:151-153; Myers, E.W. and Muller W. (1988) CABIOS 4:11-17; Robinson, 
E.D. (1971) Comb. Theor 11:105; Santou, N. Nes, M. (1987) Mol. Biol. Evol 4:406- 
425; Sneath, P.H.A. and Sokal, R.R. (1973) Numerical Taxonomy -the Principles and 
Practice of Numerical Taxonomy, Freeman Press, San Francisco, CA; Wilbur, W.J. and 

20 Lipman, D.J. (1983) Proc. Natl. Acad., Sci. USA 80:126-130. 

Alternatively, optimal alignment of sequences for comparison may be 
conducted by the local identity algorithm of Smith and Waterman (1981) Add. APL. 
Math 2:482, by the identity alignment algorithm of Needleman and Wunsch (1970) J. 
Mol. Biol. 48:443, by the search for similarity methods of Pearson and Lipman (1988) 

25 Proc. Natl. Acad. Sci. USA 85: 2444, by computerized implementations of these 
algorithms (GAP, BESTFIT, BLAST, FASTA, and TFASTA in the Wisconsin Genetics 
Software Package, Genetics Computer Group (GCG), 575 Science Dr., Madison, WI), 
or by inspection. 

One preferred example of algorithms that are suitable for determining 
30 percent sequence identity and sequence similarity are the BLAST and BLAST 2.0 
algorithms, which are described in Altschul et al. (1977) Nucl. Acids Res. 25:3389-3402 
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and Altschul et al. (1990) J. Mol. Biol. 215:403-410, respectively. BLAST and BLAST 
2.0 can be used, for example with the parameters described herein, to determine percent 
sequence identity for the polynucleotides and polypeptides of the invention. Software 
for performing BLAST analyses is publicly available through the National Center for 
5 Biotechnology Information. For amino acid sequences, a scoring matrix can be used to 
calculate the cumulative score. Extension of the word hits in each direction are halted 
when: the cumulative alignment score falls off by the quantity X from its maximum 
achieved value; the cumulative score goes to zero or below, due to the accumulation of 
one or more negative-scoring residue alignments; or the end of either sequence is 

10 reached. The BLAST algorithm parameters W, T and X determine the sensitivity and 
speed of the alignment. 

In one preferred approach, the "percentage of sequence identity" is 
determined by comparing two optimally aligned sequences over a window of 
comparison of at least 20 positions, wherein the portion of the polypeptide sequence in 

15 the comparison window may comprise additions or deletions (i.e., gaps) of 20 percent 
or less, usually 5 to 15 percent, or 10 to 12 percent, as compared to the reference 
sequences (which does not comprise additions or deletions) for optimal alignment of the 
two sequences. The percentage is calculated by determining the number of positions at 
which the identical amino acid residue occurs in both sequences to yield the number of 

20 matched positions, dividing the number of matched positions by the total number of 
positions in the reference sequence (i.e., the window size) and multiplying the results by 
100 to yield the percentage of sequence identity. 

Within other illustrative embodiments, a polypeptide may be a fusion 
polypeptide that comprises multiple polypeptides as described herein, or that comprises 

25 at least one polypeptide as described herein and an unrelated sequence, such as a known 
tumor protein. A fusion partner may, for example, assist in providing T helper epitopes 
(an immunological fusion partner), preferably T helper epitopes recognized by humans, 
or may assist in expressing the protein (an expression enhancer) at higher yields than the 
native recombinant protein. Certain preferred fusion partners are both immunological 

30 and expression enhancing fusion partners. Other fusion partners may be selected so as 
to increase the solubility of the polypeptide or to enable the polypeptide to be targeted to 
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desired intracellular compartments. Still further fusion partners include affinity tags, 
which facilitate purification of the polypeptide. 

Fusion polypeptides may generally be prepared using standard 
techniques, including chemical conjugation. Preferably, a fusion polypeptide is 
5 expressed as a recombinant polypeptide, allowing the production of increased levels, 
relative to a non-fused polypeptide, in an expression system. Briefly, DNA sequences 
encoding the polypeptide components may be assembled separately, and ligated into an 
appropriate expression vector. The 3' end of the DNA sequence encoding one 
polypeptide component is ligated, with or without a peptide linker, to the 5' end of a 

10 DNA sequence encoding the second polypeptide component so that the reading frames 
of the sequences are in phase. This permits translation into a single fusion polypeptide 
that retains the biological activity of both component polypeptides. 

A peptide linker sequence may be employed to separate the first and 
second polypeptide components by a distance sufficient to ensure that each polypeptide 

15 folds into its secondary and tertiary structures. Such a peptide linker sequence is 
incorporated into the fusion polypeptide using standard techniques well known in the 
art. Suitable peptide linker sequences may be chosen based on the following factors: 
(1) their ability to adopt a flexible extended conformation; (2) their inability to adopt a 
secondary structure that could interact with functional epitopes on the first and second 

20 polypeptides; and (3) the lack of hydrophobic or charged residues that might react with 
the polypeptide functional epitopes. Preferred peptide linker sequences contain Gly, 
Asn and Ser residues. Other near neutral amino acids, such as Thr and Ala may also be 
used in the linker sequence. Amino acid sequences which may be usefully employed as 
linkers include those disclosed in Maratea et al., Gene 40:39-46, 1985; Murphy et al., 

25 Proc. Natl. Acad. Set USA 55:8258-8262, 1986; U.S. Patent No. 4,935,233 and U.S. 
Patent No. 4,751,180. The linker sequence may generally be from 1 to about 50 amino 
acids in length. Linker sequences are not required when the first and second 
polypeptides have non-essential N-terminal amino acid regions that can be used to 
separate the functional domains and prevent steric interference. 

30 The ligated DNA sequences are operably linked to suitable 

transcriptional or translational regulatory elements. The regulatory elements 
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responsible for expression of DNA are located only 5' to the DNA sequence encoding 
the first polypeptides. Similarly, stop codons required to end translation and 
transcription termination signals are only present 3' to the DNA sequence encoding the 
second polypeptide. 

5 The fusion polypeptide can comprise a polypeptide as described herein 

together with an unrelated immunogenic protein, such as an immunogenic protein 
capable of eliciting a recall response. Examples of such proteins include tetanus, 
tuberculosis and hepatitis proteins (see, for example, Stoute et al. New Engl. J. Med., 
33*86-91, 1997). 

10 In one preferred embodiment, the immunological fusion partner is 

derived from a Mycobacterium sp., such as a Mycobacterium tuberculosis-derived Ral2 
fragment. Ral2 compositions and methods for their use in enhancing the expression 
and/or inimunogenicity of heterologous polynucleotide/polypeptide sequences is 
described in U.S. Patent Application 60/158,585, the disclosure of which is 

1 5 incorporated herein by reference in its entirety. Briefly, Ral 2 refers to a polynucleotide 
region that is a subsequence of a Mycobacterium tuberculosis MTB32A nucleic acid. 
MTB32A is a serine protease of 32 KD molecular weight encoded by a gene in virulent 
and avirulent strains of M. tuberculosis. The nucleotide sequence and amino acid 
sequence of MTB32A have been described (for example, U.S. Patent Application 

20 60/158,585; see also, Skeiky et al, Infection and Immun. (1999) 67:3998-4007, 
incorporated herein by reference). C-terminal fragments of the MTB32A coding 
sequence express at high levels and remain as a soluble polypeptides throughout the 
purification process. Moreover, Ral 2 may enhance the immunogenicity of heterologous 
immunogenic polypeptides with which it is fused. One preferred Ral 2 fusion 

25 polypeptide comprises a 14 KD C-terminal fragment corresponding to amino acid 
residues 192 to 323 of MTB32A. 

Other preferred Ral 2 polynucleotides generally comprise at least about 
15 consecutive nucleotides, at least about 30 nucleotides, at least about 60 nucleotides, 
at least about 100 nucleotides, at least about 200 nucleotides, or at least about 300 

30 nucleotides that encode a portion of a Ral 2 polypeptide. 
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Ral2 polynucleotides may comprise a native sequence (i.e., an 
endogenous sequence that encodes a Ral2 polypeptide or a portion thereof) or may 
comprise a variant of such a sequence. Ral2 polynucleotide variants may contain one 
or more substitutions, additions, deletions and/or insertions such that the biological 
5 activity of the encoded fusion polypeptide is not substantially diminished, relative to a 
fusion polypeptide comprising a native Ral2 polypeptide. Variants preferably exhibit at 
least about 70% identity, more preferably at least about 80% identity and most 
preferably at least about 90% identity to a polynucleotide sequence that encodes a native 
Ral2 polypeptide or a portion thereof. 

10 Within other preferred embodiments, an immunological fusion partner is 

derived from protein D, a surface protein of the gram-negative bacterium Haemophilus 
influenza B (WO 91/18926). Preferably, a protein D derivative comprises 
approximately the first third of the protein (e.g., the first N-terminal 100-110 amino 
acids), and a protein D derivative may be lipidated. Within certain preferred 

15 embodiments, the first 109 residues of a Lipoprotein D fusion partner is included on the 
N-terminus to provide the polypeptide with additional exogenous T-cell epitopes and to 
increase the expression level in E. coli (thus functioning as an expression enhancer). 
The lipid tail ensures optimal presentation of the antigen to antigen presenting cells. 
Other fusion partners include the non-structural protein from influenzae virus, NS1 

20 (hemaglutinin). Typically, the N-terminal 81 amino acids are used, although different 
fragments that include T-helper epitopes may be used. 

In another embodiment, the immunological fusion partner is the protein 
known as LYTA, or a portion thereof (preferably a C-terminal portion). LYTA is 
derived from Streptococcus pneumoniae, which synthesizes an N-acetyl-L-alanine 

25 amidase known as amidase LYTA (encoded by the LytA gene; Gene 43:265-292, 1986). 
LYTA is an autolysin that specifically degrades certain bonds in the peptidoglycan 
backbone. The C-terminal domain of the LYTA protein is responsible for the affinity to 
the choline or to some choline analogues such as DEAE. This property has been 
exploited for the development of E. coli C-LYTA expressing plasmids useful for 

30 expression of fusion proteins. Purification of hybrid proteins containing the C-LYTA 
fragment at the amino terminus has been described (see Biotechnology 70:795-798, 
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1992). Within a preferred embodiment, a repeat portion of LYTA may be incorporated 
into a fusion polypeptide. A repeat portion is found in the C-terminal region starting at 
residue 178. A particularly preferred repeat portion incorporates residues 188-305. 

Yet another illustrative embodiment involves fusion polypeptides, and 
5 the polynucleotides encoding them, wherein the fusion partner comprises a targeting 
signal capable of directing a polypeptide to the endosomal/lysosomal compartment, as 
described in U.S. Patent No. 5,633,234. An hnmunogenic polypeptide of the invention, 
when fused with this targeting signal, will associate more efficiently with MHC class II 
molecules and thereby provide enhanced in vivo stimulation of CD4 + T-cells specific 

1 0 for the polypeptide. 

Polypeptides of the invention are prepared using any of a variety of well 
known synthetic and/or recombinant techniques, the latter of which are further 
described below. Polypeptides, portions and other variants generally less than about 
150 amino acids can be generated by synthetic means, using techniques well known to 

15 those of ordinary skill in the art. In one illustrative example, such polypeptides are 
synthesized using any of the commercially available solid-phase techniques, such as the 
Merrifield solid-phase synthesis method, where amino acids are sequentially added to a 
growing amino acid chain. See Merrifield, J. Am. Chem. Soc. 55:2149-2146, 1963. 
Equipment for automated synthesis of polypeptides is commercially available from 

20 suppliers such as Perkin Elmer/Applied BioSystems Division (Foster City, CA), and 
may be operated according to the manufacturer's instructions. 

In general, polypeptide compositions (including fusion polypeptides) of 
the invention are isolated. An "isolated" polypeptide is one that is removed from its 
original environment. For example, a naturally-occurring protein or polypeptide is 

25 isolated if it is separated from some or all of the coexisting materials in the natural 
system. Preferably, such polypeptides are also purified, e.g., are at least about 90% 
pure, more preferably at least about 95% pure and most preferably at least about 99% 
pure. 

Polynucleotide Compositions 
30 The present invention, in other aspects, provides polynucleotide 

compositions. The terms "DNA" and "polynucleotide" are used essentially 
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interchangeably herein to refer to a DNA molecule that has been isolated free of total 
genomic DNA of a particular species. "Isolated," as used herein, means that a 
polynucleotide is substantially away from other coding sequences, and that the DNA 
molecule does not contain large portions of unrelated coding DNA, such as large 
5 chromosomal fragments or other functional genes or polypeptide coding regions. Of 
course, this refers to the DNA molecule as originally isolated, and does not exclude 
genes or coding regions later added to the segment by the hand of man. 

As will be understood by those skilled in the art, the polynucleotide 
compositions of this invention can include genomic sequences, extra-genomic and 

10 plasmid-encoded sequences and smaller engineered gene segments that express, or may 
be adapted to express, proteins, polypeptides, peptides and the like. Such segments may 
be naturally isolated, or modified synthetically by the hand of man. 

As will be also recognized by the skilled artisan, polynucleotides of the 
invention may be single-stranded (coding or antisense) or double-stranded, and may be 

15 DNA (genomic, cDNA or synthetic) or RNA molecules. RNA molecules may include 
HnRNA molecules, which contain introns and correspond to a DNA molecule in a one- 
to-one manner, and mRNA molecules, which do not contain introns. Additional coding 
or non-coding sequences may, but need not, be present within a polynucleotide of the 
present invention, and a polynucleotide may, but need not, be linked to other molecules 

20 and/or support materials. 

Polynucleotides may comprise a native sequence (i.e., an endogenous 
sequence that encodes a polypeptide/protein of the invention or a portion thereof) or 
may comprise a sequence that encodes a variant or derivative, preferably and 
immunogenic variant or derivative, of such a sequence. 

25 Therefore, according to another aspect of the present invention, 

polynucleotide compositions are provided that comprise some or all of a polynucleotide 
sequence set forth in any one of SEQ ID NO:l-3, 6-8, 10-13, 15-27, 29, 30, 32, 34-49, 
51, 52, 54, 55, 57-59, 61-69, 71, 73, 74, 77, 78, 80-82, 84, 86-96, 107-109, 111, 113, 
125, 127, 128, 129, 131-133, 142, 144, 148-151, 153, 154, 157, 158, 160, 167, 168, 

30 171, 179, 182, 184-186, 188-191, 193, 194, 198-207, 209, 210, 213, 214, 217, 220-224, 
253-337, 345, 347, 349, 358, 362, 364, 365, 368, 370-375, 420, 424, 428, 431, 434, 
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442, 447, 450 and 467, complements of a polynucleotide sequence set forth in any one 
of SEQ ID NO:l-3, 6-8, 10-13, 15-27, 29, 30, 32, 34-49, 51, 52, 54, 55, 57-59, 61-69, 
71, 73, 74, 77, 78, 80-82, 84, 86-96, 107-109, 111, 113, 125, 127, 128, 129, 131-133, 
142, 144, 148-151, 153, 154, 157, 158, 160, 167, 168, 171, 179, 182, 184-186, 188-191, 
5 193, 194, 198-207, 209, 210, 213, 214, 217, 220-224, 253-337, 345, 347, 349, 358, 362, 

364, 365, 368, 370-375, 420, 424, 428, 431, 434, 442, 447, 450 and 467, and degenerate 
variants of a polynucleotide sequence set forth in any one of SEQ ID NO: 1-3, 6-8, 10- 
13, 15-27, 29, 30, 32, 34-49, 51, 52, 54, 55, 57-59, 61-69, 71, 73, 74, 77, 78, 80-82, 84, 
86-96, 107-109, 111, 113, 125, 127, 128, 129, 131-133, 142, 144, 148-151, 153, 154, 

10 157, 158, 160, 167, 168, 171, 179, 182, 184-186, 188-191, 193, 194, 198-207, 209, 210, 
213, 214, 217, 220-224, 253-337, 345, 347, 349, 358, 362, 364, 365, 368, 370-375, 420, 
424, 428, 431, 434, 442, 447, 450 and 467. In certain preferred embodiments, the 
polynucleotide sequences set forth herein encode immunogenic polypeptides, as 
described above. 

15 In other related embodiments, the present invention provides 

polynucleotide variants having substantial identity to the sequences disclosed herein in 
SEQ IDNO:l-3, 6-8, 10-13, 15-27, 29, 30, 32, 34-49, 51, 52, 54, 55, 57-59, 61-69, 71, 
73, 74, 77, 78, 80-82, 84, 86-96, 107-109, 111, 113, 125, 127, 128, 129, 131-133, 142, 
144, 148-151, 153, 154, 157, 158, 160, 167, 168, 171, 179, 182, 184-186, 188-191, 193, 

20 194, 198-207, 209, 210, 213, 214, 217, 220-224, 253-337, 345, 347, 349, 358, 362, 364, 

365, 368, 370-375, 420, 424, 428, 431, 434, 442, 447, 450 and 467, for example those 
comprising at least 70% sequence identity, preferably at least 75%, 80%, 85%, 90%, 
95%, 96%, 97%, 98%, or 99% or higher, sequence identity compared to a 
polynucleotide sequence of this invention using the methods described herein, (e.g., 

25 BLAST analysis using standard parameters, as described below). One skilled in this art 
will recognize that these values can be appropriately adjusted to determine 
corresponding identity of proteins encoded by two nucleotide sequences by taking into 
account codon degeneracy, amino acid similarity, reading frame positioning and the 
like. 

30 Typically, polynucleotide variants will contain one or more substitutions, 

additions, deletions and/or insertions, preferably such that the immunogenicity of the 
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polypeptide encoded by the variant polynucleotide is not substantially diminished 
relative to a polypeptide encoded by a polynucleotide sequence specifically set forth 
herein). The term "variants" should also be understood to encompasses homologous 
genes of xenogenic origin. 
5 In additional embodiments, the present invention provides 

polynucleotide fragments comprising various lengths of contiguous stretches of 
sequence identical to or complementary to one or more of the sequences disclosed 
herein. For example, polynucleotides are provided by this invention that comprise at 
least about 10, 15, 20, 30, 40, 50, 75, 100, 150, 200, 300, 400, 500 or 1000 or more 

10 contiguous nucleotides of one or more of the sequences disclosed herein as well as all 
intermediate lengths there between. It will be readily understood that "intermediate 
lengths", in this context, means any length between the quoted values, such as 16, 17, 
18, 19, etc.; 21, 22, 23, etc.; 30, 31, 32, etc.; 50, 51, 52, 53, etc.; 100, 101, 102, 103, 
etc.; 150, 151, 152, 153, etc. ; including all integers through 200-500; 500-1,000, and the 

15 like. 

In another embodiment of the invention, polynucleotide compositions are 
provided that are capable of hybridizing under moderate to high stringency conditions to 
a polynucleotide sequence provided herein, or a fragment thereof, or a complementary 
sequence thereof. Hybridization techniques are well known in the art of molecular 

20 biology. For purposes of illustration, suitable moderately stringent conditions for 
testing the hybridization of a polynucleotide of this invention with other polynucleotides 
include prewashing in a solution of 5 X SSC, 0.5% SDS, 1.0 mM EDTA (pH 8.0); 
hybridizing at 50°C-60°C, 5 X SSC, overnight; followed by washing twice at 65°C for 
20 minutes with each of 2X, 0.5X and 0.2X SSC containing 0.1% SDS. One skilled in 

25 the art will understand that the stringency of hybridization can be readily manipulated, 
such as by altering the salt content of the hybridization solution and/or the temperature 
at which the hybridization is performed. For example, in another embodiment, suitable 
highly stringent hybridization conditions include those described above, with the 
exception that the temperature of hybridization is increased, e.g., to 60-65°C or 65- 

30 70°C. 
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In certain preferred embodiments, the polynucleotides described above, 
e.g., polynucleotide variants, fragments and hybridizing sequences, encode polypeptides 
that are immunologically cross-reactive with a polypeptide sequence specifically set 
forth herein. In other preferred embodiments, such polynucleotides encode 
5 polypeptides that have a level of immunogenic activity of at least about 50%, preferably 
at least about 70%, and more preferably at least about 90% of that for a polypeptide 
sequence specifically set forth herein. 

The polynucleotides of the present invention, or fragments thereof, 
regardless of the length of the coding sequence itself, may be combined with other DNA 

10 sequences, such as promoters, polyadenylation signals, additional restriction enzyme 
sites, multiple cloning sites, other coding segments, and the like, such that their overall 
length may vary considerably. It is therefore contemplated that a nucleic acid fragment 
of almost any length may be employed, with the total length preferably being limited by 
the ease of preparation and use in the intended recombinant DNA protocol. For 

15 example, illustrative polynucleotide segments with total lengths of about 10,000, about 
5000, about 3000, about 2,000, about 1,000, about 500, about 200, about 100, about 50 
base pairs in length, and the like, (including all intermediate lengths) are contemplated 
to be useful in many implementations of this invention. 

When comparing polynucleotide sequences, two sequences are said to be 

20 "identical" if the sequence of nucleotides in the two sequences is the same when aligned 
for maximum correspondence, as described below. Comparisons between two 
sequences are typically performed by comparing the sequences over a comparison 
window to identify and compare local regions of sequence similarity. A "comparison 
window" as used herein, refers to a segment of at least about 20 contiguous positions, 

25 usually 30 to about 75, 40 to about 50, in which a sequence may be compared to a 
reference sequence of the same number of contiguous positions after the two sequences 
are optimally aligned. 

Optimal alignment of sequences for comparison may be conducted using 
the Megalign program in the Lasergene suite of bioinformatics software (DNASTAR, 

30 Inc., Madison, WI), using default parameters. This program embodies several 
alignment schemes described in the following references: Dayhoff, M.O. (1978) A 
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model of evolutionary change in proteins - Matrices for detecting distant relationships. 
In Dayhoff, M.O. (ed.) Atlas of Protein Sequence and Structure, National Biomedical 
Research Foundation, Washington DC Vol. 5, Suppl. 3, pp. 345-358; Hein J. (1990) 
Unified Approach to Alignment and Phylogenes pp. 626-645 Methods in Enzymology 
5 vol. 183, Academic Press, Inc., San Diego, CA; Higgins, D.G. and Sharp, P.M. (1989) 
CABIOS 5:151-153; Myers, E.W. and Midler W. (1988) CABIOS 4:11-17; Robinson, 
E.D. (1971) Comb. Theor 11:105; Santou, N. Nes, M. (1987) Mol. Biol. Evol. 4:406- 
425; Sneath, P.H.A. and Sokal, R.R. (1973) Numerical Taxonomy — the Principles and 
Practice of Numerical Taxonomy, Freeman Press, San Francisco, CA; Wilbur, W.J. and 

1 0 Lipman, D.J. (1983) Proc. Natl. Acad., Sci. USA 80:726-730. 

Alternatively, optimal alignment of sequences for comparison may be 
conducted by the local identity algorithm of Smith and Waterman (1981) Add. APL. 
Math 2:482, by the identity alignment algorithm of Needleman and Wunsch (1970) J. 
Mol. Biol. 48:443, by the search for similarity methods of Pearson and Lipman (1988) 

15 Proc. Natl. Acad. Sci. USA 85: 2444, by computerized implementations of these 
algorithms (GAP, BESTFIT, BLAST, FASTA, and TFASTA in the Wisconsin Genetics 
Software Package, Genetics Computer Group (GCG), 575 Science Dr., Madison, WI), 
or by inspection. 

One preferred example of algorithms that are suitable for determining 
20 percent sequence identity and sequence similarity are the BLAST and BLAST 2.0 
algorithms, which are described in Altschul et al. (1977) Nucl. Acids Res. 25:3389-3402 
and Altschul et al. (1990) J. Mol. Biol. 215:403-410, respectively. BLAST and BLAST 
2.0 can be used, for example with the parameters described herein, to determine percent 
sequence identity for the polynucleotides of the invention. Software for performing 
25 BLAST analyses is publicly available through the National Center for Biotechnology 
Information. In one illustrative example, cumulative scores can be calculated using, for 
nucleotide sequences, the parameters M (reward score for a pair of matching residues; 
always >0) and N (penalty score for mismatching residues; always <0). Extension of 
the word hits in each direction are halted when: the cumulative alignment score falls off 
30 by the quantity X from its maximum achieved value; the cumulative score goes to zero 
or below, due to the accumulation of one or more negative-scoring residue alignments; 
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or the end of either sequence is reached. The BLAST algorithm parameters W, T and X 
determine the sensitivity and speed of the alignment. The BLASTN program (for 
nucleotide sequences) uses as defaults a wordlength (W) of 11, and expectation (E) of 
10, and the BLOSUM62 scoring matrix (see Henikoff and Henikoff (1989) Proc. Natl. 
5 Acad. Sci. USA 89:10915) alignments, (B) of 50, expectation (E) of 10, M=5, N=-4 and 
a comparison of both strands. 

Preferably, the "percentage of sequence identity" is determined by 
comparing two optimally aligned sequences over a window of comparison of at least 20 
positions, wherein the portion of the polynucleotide sequence in the comparison 

10 window may comprise additions or deletions (i.e., gaps) of 20 percent or less, usually 5 
to 15 percent, or 10 to 12 percent, as compared to the reference sequences (which does 
not comprise additions or deletions) for optimal alignment of the two sequences. The 
percentage is calculated by determining the number of positions at which the identical 
nucleic acid bases occurs in both sequences to yield the number of matched positions, 

15 dividing the number of matched positions by the total number of positions in the 
reference sequence (i.e., the window size) and multiplying the results by 100 to yield the 
percentage of sequence identity. 

It will be appreciated by those of ordinary skill in the art that, as a result 
of the degeneracy of the genetic code, there are many nucleotide sequences that encode 

20 a polypeptide as described herein. Some of these polynucleotides bear minimal 
homology to the nucleotide sequence of any native gene. Nonetheless, polynucleotides 
that vary due to differences in codon usage are specifically contemplated by the present 
invention. Further, alleles of the genes comprising the polynucleotide sequences 
provided herein are within the scope of the present invention. Alleles are endogenous 

25 genes that are altered as a result of one or more mutations, such as deletions, additions 
and/or substitutions of nucleotides. The resulting mRNA and protein may, but need not, 
have an altered structure or function. Alleles may be identified using standard 
techniques (such as hybridization, amplification and/or database sequence comparison). 

Therefore, in another embodiment of the invention, a mutagenesis 

30 approach, such as site-specific mutagenesis, is employed for the preparation of 
immunogenic variants and/or derivatives of the polypeptides described herein. By this 
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approach, specific modifications in a polypeptide sequence can be made through 
mutagenesis of the underlying polynucleotides that encode them. These techniques 
provides a straightforward approach to prepare and test sequence variants, for example, 
incorporating one or more of the foregoing considerations, by introducing one or more 
5 nucleotide sequence changes into the polynucleotide. 

Site-specific mutagenesis allows the production of mutants through the 
use of specific oligonucleotide sequences which encode the DNA sequence of the 
desired mutation, as well as a sufficient number of adjacent nucleotides, to provide a 
primer sequence of sufficient size and sequence complexity to form a stable duplex on 

10 both sides of the deletion junction being traversed. Mutations may be employed in a 
selected polynucleotide sequence to improve, alter, decrease, modify, or otherwise 
change the properties of the polynucleotide itself, and/or alter the properties, activity, 
composition, stability, or primary sequence of the encoded polypeptide. 

In certain embodiments of the present invention, the inventors 

15 contemplate the mutagenesis of the disclosed polynucleotide sequences to alter one or 
more properties of the encoded polypeptide, such as the immunogenicity of a 
polypeptide vaccine. The techniques of site-specific mutagenesis are well-known in the 
art, and are widely used to create variants of both polypeptides and polynucleotides. For 
example, site-specific mutagenesis is often used to alter a specific portion of a DNA 

20 molecule. In such embodiments, a primer comprising typically about 14 to about 25 
nucleotides or so in length is employed, with about 5 to about 10 residues on both sides 
of the junction of the sequence being altered. 

As will be appreciated by those of skill in the art, site-specific 
mutagenesis techniques have often employed a phage vector that exists in both a single 

25 stranded and double stranded form. Typical vectors useful in site-directed mutagenesis 
include vectors such as the Ml 3 phage. These phage are readily commercially-available 
and their use is generally well-known to those skilled in the art. Double-stranded 
plasmids are also routinely employed in site directed mutagenesis that eliminates the 
step of transferring the gene of interest from a plasmid to a phage. 

30 In general, site-directed mutagenesis in accordance herewith is 

performed by first obtaining a single-stranded vector or melting apart of two strands of a 
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double-stranded vector that includes within its sequence a DNA sequence that encodes 
the desired peptide. An oligonucleotide primer bearing the desired mutated sequence is 
prepared, generally synthetically. This primer is then annealed with the single-stranded 
vector, and subjected to DNA polymerizing enzymes such as E. coli polymerase I 
5 Klenow fragment, in order to complete the synthesis of the mutation-bearing strand. 
Thus, a heteroduplex is formed wherein one strand encodes the original non-mutated 
sequence and the second strand bears the desired mutation. This heteroduplex vector is 
then used to transform appropriate cells, such as E. coli cells, and clones are selected 
which include recombinant vectors bearing the mutated sequence arrangement. 

10 The preparation of sequence variants of the selected peptide-encoding 

DNA segments using site-directed mutagenesis provides a means of producing 
potentially useful species and is not meant to be limiting as there are other ways in 
which sequence variants of peptides and the DNA sequences encoding them may be 
obtained. For example, recombinant vectors encoding the desired peptide sequence 

15 may be treated with mutagenic agents, such as hydroxylamine, to obtain sequence 
variants. Specific details regarding these methods and protocols are found in the 
teachings of Maloy et a!., 1994; Segal, 1976; Prokop and Bajpai, 1991; Kuby, 1994; and 
Maniatis et al. , 1982, each incorporated herein by reference, for that purpose. 

As used herein, the term "oligonucleotide directed mutagenesis 

20 procedure" refers to template-dependent processes and vector-mediated propagation 
which result in an increase in the concentration of a specific nucleic acid molecule 
relative to its initial concentration, or in an increase in the concentration of a detectable 
signal, such as amplification. As used herein, the term "oligonucleotide directed 
mutagenesis procedure" is intended to refer to a process that involves the 

25 template-dependent extension of a primer molecule. The term template dependent 
process refers to nucleic acid synthesis of an RNA or a DNA molecule wherein the 
sequence of the newly synthesized strand of nucleic acid is dictated by the well-known 
rules of complementary base pairing (see, for example, Watson, 1987). Typically, 
vector mediated methodologies involve the introduction of the nucleic acid fragment 

30 into a DNA or RNA vector, the clonal amplification of the vector, and the recovery of 
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the amplified nucleic acid fragment. Examples of such methodologies are provided by 
U. S. Patent No. 4,237,224, specifically incorporated herein by reference in its entirety. 

In another approach for the production of polypeptide variants of the 
present invention, recursive sequence recombination, as described in U.S. Patent No. 
5 5,837,458, may be employed. In this approach, iterative cycles of recombination and 
screening or selection are performed to "evolve" individual polynucleotide variants of 
the invention having, for example, enhanced immunogenic activity. 

In other embodiments of the present invention, the polynucleotide 
sequences provided herein can be advantageously used as probes or primers for nucleic 

1 0 acid hybridization. As such, it is contemplated that nucleic acid segments that comprise 
a sequence region of at least about 15 nucleotide long contiguous sequence that has the 
same sequence as, or is complementary to, a 15 nucleotide long contiguous sequence 
disclosed herein will find particular utility. Longer contiguous identical or 
complementary sequences, e.g., those of about 20, 30, 40, 50, 100, 200, 500, 1000 

15 (including all intermediate lengths) and even up to full length sequences will also be of 
use in certain embodiments. 

The ability of such nucleic acid probes to specifically hybridize to a 
sequence of interest will enable them to be of use in detecting the presence of 
complementary sequences in a given sample. However, other uses are also envisioned, 

20 such as the use of the sequence mformation for the preparation of mutant species 
primers, or primers for use in preparing other genetic constructions. 

Polynucleotide molecules having sequence regions consisting of 
contiguous nucleotide stretches of 10-14, 15-20, 30, 50, or even of 100-200 nucleotides 
or so (including intermediate lengths as well), identical or complementary to a 

25 polynucleotide sequence disclosed herein, are particularly contemplated as hybridization 
probes for use in, e.g., Southern and Northern blotting. This would allow a gene 
product, or fragment thereof, to be analyzed, both in diverse cell types and also in 
various bacterial cells. The total size of fragment, as well as the size of the 
complementary stretch(es), will ultimately depend on the intended use or application of 

30 the particular nucleic acid segment. Smaller fragments will generally find use in 
hybridization embodiments, wherein the length of the contiguous complementary region 
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may be varied, such as between about 15 and about 100 nucleotides, but larger 
contiguous complementarity stretches may be used, according to the length 
complementary sequences one wishes to detect. 

The use of a hybridization probe of about 15-25 nucleotides in length 
5 allows the formation of a duplex molecule that is both stable and selective. Molecules 
having contiguous complementary sequences over stretches greater than 15 bases in 
length are generally preferred, though, in order to increase stability and selectivity of the 
hybrid, and thereby improve the quality and degree of specific hybrid molecules 
obtained. One will generally prefer to design nucleic acid molecules having gene- 
10 complementary stretches of 15 to 25 contiguous nucleotides, or even longer where 
desired. 

Hybridization probes may be selected from any portion of any of the 
sequences disclosed herein. All that is required is to review the sequences set forth 
herein, or to any continuous portion of the sequences, from about 15-25 nucleotides in 
15 length up to and including the full length sequence, that one wishes to utilize as a probe 
or primer. The choice of probe and primer sequences may be governed by various 
factors. For example, one may wish to employ primers from towards the termini of the 
total sequence. 

Small polynucleotide segments or fragments may be readily prepared by, 
20 for example, directly synthesizing the fragment by chemical means, as is commonly 
practiced using an automated oligonucleotide synthesizer. Also, fragments may be 
obtained by application of nucleic acid reproduction technology, such as the PCR™ 
technology of U. S. Patent 4,683,202 (incorporated herein by reference), by introducing 
selected sequences into recombinant vectors for recombinant production, and by other 
25 recombinant DNA techniques generally known to those of skill in the art of molecular 
biology. 

The nucleotide sequences of the invention may be used for their ability to 
selectively form duplex molecules with complementary stretches of the entire gene or 
gene fragments of interest. Depending on the application envisioned, one will typically 
30 desire to employ varying conditions of hybridization to achieve varying degrees of 
selectivity of probe towards target sequence. For applications requiring high selectivity, 
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one will typically desire to employ relatively stringent conditions to form the hybrids, 
e.g., one will select relatively low salt and/or high temperature conditions, such as 
provided by a salt concentration of from about 0.02 M to about 0.15 M salt at 
temperatures of from about 50°C to about 70°C. Such selective conditions tolerate 
5 little, if any, mismatch between the probe and the template or target strand, and would 
be particularly suitable for isolating related sequences. 

Of course, for some applications, for example, where one desires to 
prepare mutants employing a mutant primer strand hybridized to an underlying 
template, less stringent (reduced stringency) hybridization conditions will typically be 

10 needed in order to allow formation of the heteroduplex. In these circumstances, one 
may desire to employ salt conditions such as those of from about 0.15 M to about 0.9 M 
salt, at temperatures ranging from about 20°C to about 55°C. Cross-hybridizing species 
can thereby be readily identified as positively hybridizing signals with respect to control 
hybridizations. In any case, it is generally appreciated that conditions can be rendered 

1 5 more stringent by the addition of increasing amounts of formamide, which serves to 
destabilize the hybrid duplex in the same manner as increased temperature. Thus, 
hybridization conditions can be readily manipulated, and thus will generally be a 
method of choice depending on the desired results. 

According to another embodiment of the present invention, 

20 polynucleotide compositions comprising antisense oligonucleotides are provided. 
Antisense oligonucleotides have been demonstrated to be effective and targeted 
inhibitors of protein synthesis, and, consequently, provide a therapeutic approach by 
which a disease can be treated by inhibiting the synthesis of proteins that contribute to 
the disease. The efficacy of antisense oligonucleotides for inhibiting protein synthesis 

25 is well established. For example, the synthesis of polygalactauronase and the muscarine 
type 2 acetylcholine receptor are inhibited by antisense oligonucleotides directed to their 
respective mRNA sequences (U. S. Patent 5,739,119 and U. S. Patent 5,759,829). 
Further, examples of antisense inhibition have been demonstrated with the nuclear 
protein cyclin, the multiple drug resistance gene (MDG1), ICAM-1, E-selectin, STK-1, 

30 striatal GABA A receptor and human EGF (Jaskulski et al, Science. 1988 Jun 
10;240(4858):1544-6; Vasanthakumar and Ahmed, Cancer Commun. 1989;1(4):225- 
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32; Peris et ah, Brain Res Mol Brain Res. 1998 Jim 15;57(2):3 10-20; U. S. Patent 
5,801,154; U.S. Patent 5,789,573; U. S. Patent 5,718,709 and U.S. Patent 5,610,288). 
Antisense constructs have also been described that inhibit and can be used to treat a 
variety of abnormal cellular proliferations, e.g. cancer (U. S. Patent 5,747,470; U. S. 
5 Patent 5,591,317 and U. S. Patent 5,783,683). 

Therefore, in certain embodiments, the present invention provides 
oligonucleotide sequences that comprise all, or a portion of, any sequence that is 
capable of specifically binding to polynucleotide sequence described herein, or a 
complement thereof. In one embodiment, the antisense oligonucleotides comprise DNA 

10 or derivatives thereof. In another embodiment, the oligonucleotides comprise RNA or 
derivatives thereof. In a triird embodiment, the oligonucleotides are modified DNAs 
comprising a phosphorothioated modified backbone. In a fourth embodiment, the 
oligonucleotide sequences comprise peptide nucleic acids or derivatives thereof. In 
each case, preferred compositions comprise a sequence region that is complementary, 

15 and more preferably substantially-complementary, and even more preferably, 
completely complementary to one or more portions of polynucleotides disclosed herein. 
Selection of antisense compositions specific for a given gene sequence is based upon 
analysis of the chosen target sequence and determination of secondary structure, T m , 
binding energy, and relative stability. Antisense compositions may be selected based 

20 upon their relative inability to form dimers, hairpins, or other secondary structures that 
would reduce or prohibit specific binding to the target mRNA in a host cell. Highly 
preferred target regions of the mRNA, are those which are at or near the AUG 
translation initiation codon, and those sequences which are substantially complementary 
to 5' regions of the mRNA. These secondary structure analyses and target site selection 

25 considerations can be performed, for example, using v.4 of the OLIGO primer analysis 
software and/or the BLASTN 2.0.5 algorithm software (Altschul et ah, Nucleic Acids 
Res. 1997, 25(17):3389-402). 

The use of an antisense delivery method employing a short peptide 
vector, termed MPG (27 residues), is also contemplated. The MPG peptide contains a 

30 hydrophobic domain derived from the fusion sequence of HIV gp41 and a hydrophilic 
domain from the nuclear localization sequence of SV40 T-antigen (Morris et ah, 
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Nucleic Acids Res. 1997 Jul 15;25(14):2730-6). It has been demonstrated that several 
molecules of the MPG peptide coat the antisense oligonucleotides and can be delivered 
into cultured mammalian cells in less than 1 hour with relatively high efficiency (90%). 
Further, the interaction with MPG strongly increases both the stability of the 
5 oligonucleotide to nuclease and the ability to cross the plasma membrane. 

According to another embodiment of the invention, the polynucleotide 
compositions described herein are used in the design and preparation of ribozyme 
molecules for inhibiting expression of the tumor polypeptides and proteins of the 
present invention in tumor cells. Ribo2ymes are RNA-protein complexes that cleave 

10 nucleic acids in a site-specific fashion. Ribozymes have specific catalytic domains that 
possess endonuclease activity (Kim and Cech, Proc Natl Acad Sci USA. 1987 
Dec;84(24):8788-92; Forster and Symons, Cell. 1987 Apr 24;49(2):2 11-20). For 
example, a large number of ribozymes accelerate phosphoester transfer reactions with a 
high degree of specificity, often cleaving only one of several phosphoesters in an 

15 oligonucleotide substrate (Cech et al, Cell. 1981 Dec;27(3 Pt 2):487-96; Michel and 
Westhof, J Mol Biol. 1990 Dec 5;216(3):585-610; Reinhold-Hurek and Shub, Nature. 
1992 May 14;357(6374): 173-6). This specificity has been attributed to the requirement 
that the substrate bind via specific base-pairing interactions to the internal guide 
sequence ("IGS") of the ribozyme prior to chemical reaction. 

20 Six basic varieties of naturally-occurring enzymatic RNAs are known 

presently. Each can catalyze the hydrolysis of RNA phosphodiester bonds in trans (and 
thus can cleave other RNA molecules) under physiological conditions. In general, 
enzymatic nucleic acids act by first binding to a target RNA. Such binding occurs 
through the target binding portion of a enzymatic nucleic acid which is held in close 

25 proximity to an enzymatic portion of the molecule that acts to cleave the target RNA. 
Thus, the enzymatic nucleic acid first recognizes and then binds a target RNA through 
complementary base-pairing, and once bound to the correct site, acts enzymatically to 
cut the target RNA. Strategic cleavage of such a target RNA will destroy its ability to 
direct synthesis of an encoded protein. After an enzymatic nucleic acid has bound and 

3 0 cleaved its RNA target, it is released from that RNA to search for another target and can 
repeatedly bind and cleave new targets. 
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The enzymatic nature of a ribozyme is advantageous over many 
technologies, such as antisense technology (where a nucleic acid molecule simply binds 
to a nucleic acid target to block its translation) since the concentration of ribozyme 
necessary to affect a therapeutic treatment is lower than that of an antisense 
5 oligonucleotide. This advantage reflects the ability of the ribozyme to act 
enzymatically. Thus, a single ribozyme molecule is able to cleave many molecules of 
target RNA. In addition, the ribozyme is a highly specific inhibitor, with the specificity 
of inhibition depending not only on the base pairing mechanism of binding to the target 
RNA, but also on the mechanism of target RNA cleavage. Single mismatches, or base- 

10 substitutions, near the site of cleavage can completely eliminate catalytic activity of a 
ribozyme. Similar mismatches in antisense molecules do not prevent their action 
(Woolf etal, Proc Natl Acad Sci USA. 1992 Aug 15;89(16):7305-9). Thus, the 
specificity of action of a ribozyme is greater than that of an antisense oligonucleotide 
binding the same RNA site. 

15 The enzymatic nucleic acid molecule may be formed in a hammerhead, 

hairpin, a hepatitis 5 virus, group I intron or RNaseP RNA (in association with an RNA 
guide sequence) or Neurospora VS RNA motif. Examples of hammerhead motifs are 
described by Rossi etal. Nucleic Acids Res. 1992 Sep ll;20(17):4559-65. Examples of 
hairpin motifs are described by Hampel etal. (Eur. Pat. Appl. Publ. No. EP 0360257), 

20 Hampel and Tritz, Biochemistry 1989 Jun 13;28(12):4929-33; Hampel etal, Nucleic 
Acids Res. 1990 Jan 25;18(2):299-304 and U. S. Patent 5,631,359. An example of the 
hepatitis 8 virus motif is described by Perrotta and Been, Biochemistry. 1992 Dec 
1;31(47):1 1843-52; an example of the RNaseP motif is described by Guerrier-Takada 
etal, Cell. 1983 Dec;35(3 Pt 2):849-57; Neurospora VS RNA ribozyme motif is 

25 described by Collins (Saville and Collins, Cell. 1990 May 18;61(4):685-96; Saville and 
Collins, Proc Natl Acad Sci USA. 1991 Oct l;88(19):8826-30; Collins and Olive, 
Biochemistry. 1993 Mar 23;32(1 1):2795-9); and an example of the Group I intron is 
described in (U. S. Patent 4,987,071). All that is important in an enzymatic nucleic acid 
molecule of this invention is that it has a specific substrate binding site which is 

30 complementary to one or more of the target gene RNA regions, and that it have 
nucleotide sequences within or surrounding that substrate binding site which impart an 
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RNA cleaving activity to the molecule. Thus the ribozyme constructs need not be 
limited to specific motifs mentioned herein. 

Ribozymes may be designed as described in Int. Pat. Appl. Publ. No. 
WO 93/23569 and Int. Pat. Appl. Publ. No. WO 94/02595, each specifically 
5 incorporated herein by reference) and synthesized to be tested in vitro and in vivo, as 
described. Such ribozymes can also be optimized for delivery. While specific 
examples are provided, those in the art will recognize that equivalent RNA targets in 
other species can be utilized when necessary. 

Ribozyme activity can be optimized by altering the length of the 

10 ribozyme binding arms, or chemically synthesizing ribozymes with modifications that 
prevent their degradation by serum ribonucleases (see e.g., Int. Pat. Appl. Publ. No. WO 
92/07065; Int. Pat. Appl. Publ. No. WO 93/15187; Int. Pat. Appl. Publ. No. WO 
91/03162; Eur. Pat. Appl. Publ. No. 92110298.4; U. S. Patent 5,334,711; and Int. Pat. 
Appl. Publ. No. WO 94/13688, which describe various chemical modifications that can 

15 be made to the sugar moieties of enzymatic RNA molecules), modifications which 
enhance their efficacy in cells, and removal of stem II bases to shorten RNA synthesis 
times and reduce chemical requirements. 

Sullivan et al. (Int. Pat. Appl. Publ. No. WO 94/02595) describes the 
general methods for delivery of enzymatic RNA molecules. Ribozymes may be 

20 administered to cells by a variety of methods known to those familiar to the art, 
including, but not restricted to, encapsulation in liposomes, by iontophoresis, or by 
incorporation into other vehicles, such as hydrogels, cyclodextrins, biodegradable 
nanocapsules, and bioadhesive microspheres. For some indications, ribozymes may be 
directly delivered ex vivo to cells or tissues with or without the aforementioned vehicles. 

25 Alternatively, the RNA/vehicle combination may be locally delivered by direct 
inhalation, by direct injection or by use of a catheter, infusion pump or stint. Other 
routes of delivery include, but are not limited to, intravascular, intramuscular, 
subcutaneous or joint injection, aerosol inhalation, oral (tablet or pill form), topical, 
systemic, ocular, intraperitoneal and/or intrathecal delivery. More detailed descriptions 

30 of ribozyme delivery and administration are provided in Int. Pat. Appl. Publ. No. WO 
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94/02595 and Int. Pat. Appl. Publ. No. WO 93/23569, each specifically incorporated 
herein by reference. 

Another means of accumulating high concentrations of a ribozyme(s) 
within cells is to incorporate the ribozyme-encoding sequences into a DNA expression 
5 vector. Transcription of the ribozyme sequences are driven from a promoter for 
eukaryotic RNA polymerase I (pol I), RNA polymerase II (pol II), or RNA polymerase 
III (pol III). Transcripts from pol II or pol III promoters will be expressed at high levels 
in all cells; the levels of a given pol II promoter in a given cell type will depend on the 
nature of the gene regulatory sequences (enhancers, silencers, etc.) present nearby. 

10 Prokaryotic RNA polymerase promoters may also be used, providing that the 
prokaryotic RNA polymerase enzyme is expressed in the appropriate cells. Ribozymes 
expressed from such promoters have been shown to function in mammalian cells. Such 
transcription units can be incorporated into a variety of vectors for introduction into 
mammalian cells, including but not restricted to, plasmid DNA vectors, viral DNA 

1 5 vectors (such as adenovirus or adeno-associated vectors), or viral RNA vectors (such as 
retroviral, semliki forest virus, sindbis virus vectors). 

In another embodiment of the invention, peptide nucleic acids (PNAs) 
compositions are provided. PNA is a DNA mimic in which the nucleobases are 
attached to a pseudopeptide backbone (Good and Nielsen, Antisense Nucleic Acid Drug 

20 Dev. 1997 7(4) 431-37). PNA is able to be utilized in a number methods that 
traditionally have used RNA or DNA. Often PNA sequences perform better in 
techniques than the corresponding RNA or DNA sequences and have utilities that are 
not inherent to RNA or DNA. A review of PNA including methods of making, 
characteristics of, and methods of using, is provided by Corey (Trends Biotechnol 1997 

25 Jun;15(6):224-9). As such, in certain embodiments, one may prepare PNA sequences 
that are complementary to one or more portions of the ACE mRNA sequence, and such 
PNA compositions may be used to regulate, alter, decrease, or reduce the translation of 
ACE-specific mRNA, and thereby alter the level of ACE activity in a host cell to which 
such PNA compositions have been administered. 

30 PNAs have 2-aminoethyl-glycine linkages replacing the normal 

phosphodiester backbone of DNA (Nielsen et ah, Science 1991 Dec 6;254(5037):1497- 
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500; Hanvey et al, Science. 1992 Nov 27;258(5087):1481-5; Hyrup and Nielsen, 
Bioorg Med Chem. 1996 Jan;4(l):5-23). This chemistry has three important 
consequences: firstly, in contrast to DNA or phosphorothioate oligonucleotides, PNAs 
are neutral molecules; secondly, PNAs are achiral, which avoids the need to develop a 
5 stereoselective synthesis; and thirdly, PNA synthesis uses standard Boc or Fmoc 
protocols for solid-phase peptide synthesis, although other methods, including a 
modified Merrifield method, have been used. 

PNA monomers or ready-made oligomers are commercially available 
from PerSeptive Biosystems (Framingham, MA). PNA syntheses by either Boc or 

1 0 Fmoc protocols are straightforward using manual or automated protocols (Norton et al. , 
Bioorg Med Chem. 1995 Apr;3(4):437-45). The manual protocol lends itself to the 
production of chemically modified PNAs or the simultaneous synthesis of families of 
closely related PNAs. 

As with peptide synthesis, the success of a particular PNA synthesis will 

15 depend on the properties of the chosen sequence. For example, while in theory PNAs 
can incorporate any combination of nucleotide bases, the presence of adjacent purines 
can lead to deletions of one or more residues in the product. In expectation of this 
difficulty, it is suggested that, in producing PNAs with adjacent purines, one should 
repeat the coupling of residues likely to be added inefficiently. This should be followed 

20 by the purification of PNAs by reverse-phase high-pressure liquid chromatography, 
providing yields and purity of product similar to those observed during the synthesis of 
peptides. 

Modifications of PNAs for a given application may be accomplished by 
coupling amino acids during solid-phase synthesis or by attaching compounds that 

25 contain a carboxylic acid group to the exposed N-terminal amine. Alternatively, PNAs 
can be modified after synthesis by coupling to an introduced lysine or cysteine. The 
ease with which PNAs can be modified facilitates optimization for better solubility or 
for specific functional requirements. Once synthesized, the identity of PNAs and their 
derivatives can be confirmed by mass spectrometry. Several studies have made and 

30 utilized modifications of PNAs (for example, Norton et al, Bioorg Med Chem. 1995 
Apr;3(4):437-45; Petersen et al, J Pept Sci. 1995 May-Jun;l(3):175-83; Oram et al, 
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Biotechniques. 1995 Sep;19(3):472-80; Footer et al, Biochemistiy. 1996 Aug 
20;35(33):10673-9; Griffith et al, Nucleic Acids Res. 1995 Aug ll;23(15):3003-8; 
Pardridge et al, Proc Natl Acad Sci USA. 1995 Jun 6;92(12):5592-6; Boffa et al, 
Proc Natl Acad Sci USA. 1995 Mar 14;92(6):1901-5; Gambacorti-Passerini et al, 
5 Blood. 1996 Aug 15;88(4):1411-7; Armitage et al, Proc Natl Acad Sci USA. 1997 
Nov 11;94(23): 12320-5; Seeger et al, Biotechniques. 1997 Sep;23(3):512-7). U.S. 
Patent No. 5,700,922 discusses PNA-DNA-PNA chimeric molecules and their uses in 
diagnostics, modulating protein in organisms, and treatment of conditions susceptible to 
therapeutics. 

10 Methods of characterizing the antisense binding properties of PNAs are 

discussed in Rose (Anal Chem. 1993 Dec 15;65(24):3545-9) and Jensen et al. 
(Biochemistry. 1997 Apr 22;36(16):5072-7). Rose uses capillary gel electrophoresis to 
determine binding of PNAs to their complementary oligonucleotide, measuring the 
relative binding kinetics and stoichiometry. Similar types of measurements were made 

1 5 by Jensen et al. using BIAcore™ technology. 

Other applications of PNAs that have been described and will be 
apparent to the skilled artisan include use in DNA strand invasion, antisense inhibition, 
mutational analysis, enhancers of transcription, nucleic acid purification, isolation of 
transcriptionally active genes, blocking of transcription factor binding, genome 

20 cleavage, biosensors, in situ hybridization, and the like. 

Polynucleotide Identification, Characterization and Expression 

Polynucleotides compositions of the present invention may be identified, 
prepared and/or manipulated using any of a variety of well established techniques (see 
generally, Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring 

25 Harbor Laboratories, Cold Spring Harbor, NY, 1989, and other like references). For 
example, a polynucleotide may be identified, as described in more detail below, by 
screening a microarray of cDNAs for tumor-associated expression {i.e., expression that 
is at least two fold greater in a tumor than in normal tissue, as determined using a 
representative assay provided herein). Such screens may be performed, for example, 

30 using the microarray technology of Affymetrix, Inc. (Santa Clara, CA) according to the 
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manufacturer's instructions (and essentially as described by Schena et al., Proc. Natl. 
Acad. Sci. USA P5:10614-10619, 1996 and Heller et al., Proc. Natl. Acad. Set USA 
£4:2150-2155, 1997). Alternatively, polynucleotides may be amplified from cDNA 
prepared from cells expressing the proteins described herein, such as tumor cells. 
5 Many template dependent processes are available to amplify a target 

sequences of interest present in a sample. One of the best known amplification methods 
is the polymerase chain reaction (PCR™) which is described in detail in U.S. Patent 
Nos. 4,683,195, 4,683,202 and 4,800,159, each of which is incorporated herein by 
reference in its entirety. Briefly, in PCR™, two primer sequences are prepared which 

10 are complementary to regions on opposite complementary strands of the target 
sequence. An excess of deoxynucleoside triphosphates is added to a reaction mixture 
along with a DNA polymerase {e.g., Taq polymerase). If the target sequence is present 
in a sample, the primers will bind to the target and the polymerase will cause the 
primers to be extended along the target sequence by adding on nucleotides. By raising 

15 and lowering the temperature of the reaction mixture, the extended primers will 
dissociate from the target to form reaction products, excess primers will bind to the 
target and to the reaction product and the process is repeated. Preferably reverse 
transcription and PCR™ amplification procedure may be performed in order to quantify 
the amount of mRNA amplified. Polymerase chain reaction methodologies are well 

20 known in the art. 

Any of a number of other template dependent processes, many of which 
are variations of the PCR ™ amplification technique, are readily known and available in 
the art. Illustratively, some such methods include the ligase chain reaction (referred to 
as LCR), described, for example, in Eur. Pat. Appl. Publ. No. 320,308 and U.S. Patent 

25 No. 4,883,750; Qbeta Replicase, described in PCT Intl. Pat. Appl. Publ. No. 
PCT/US87/00880; Strand Displacement Amplification (SDA) and Repair Chain 
Reaction (RCR). Still other amplification methods are described in Great Britain Pat. 
Appl. No. 2 202 328, and in PCT Intl. Pat. Appl. Publ. No. PCT/US89/01025. Other 
nucleic acid amplification procedures include transcription-based amplification systems 

30 (TAS) (PCT Intl. Pat. Appl. Publ. No. WO 88/10315), including nucleic acid sequence 
based amplification (NASBA) and 3 SR. Eur. Pat. Appl. Publ. No. 329,822 describes a 
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nucleic acid amplification process involving cyclically synthesizing single-stranded 
RNA ("ssRNA"), ssDNA, and double-stranded DNA (dsDNA). PCT Intl. Pat. Appl. 
Publ. No. WO 89/06700 describes a nucleic acid sequence amplification scheme based 
on the hybridization of a promoter/primer sequence to a target single-stranded DNA 
5 ("ssDNA") followed by transcription of many RNA copies of the sequence. Other 
amplification methods such as "RACE" (Frohman, 1990), and "one-sided PCR" (Ohara, 
1989) are also well-known to those of skill in the art. 

An amplified portion of a polynucleotide of the present invention may be 
used to isolate a full length gene from a suitable library (e.g., a tumor cDNA library) 

10 using well known techniques. Within such techniques, a library (cDNA or genomic) is 
screened using one or more polynucleotide probes or primers suitable for amplification. 
Preferably, a library is size-selected to include larger molecules. Random primed 
libraries may also be preferred for identifying 5' and upstream regions of genes. 
Genomic libraries are preferred for obtaining introns and extending 5' sequences. 

15 For hybridization techniques, a partial sequence may be labeled (e.g., by 

nick-translation or end-labeling with 32 P) using well known techniques. A bacterial or 
bacteriophage library is then generally screened by hybridizing filters containing 
denatured bacterial colonies (or lawns containing phage plaques) with the labeled probe 
(see Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 

20 Laboratories, Cold Spring Harbor, NY, 1989). Hybridizing colonies or plaques are 
selected and expanded, and the DNA is isolated for further analysis. cDNA clones may 
be analyzed to determine the amount of additional sequence by, for example, PCR using 
a primer from the partial sequence and a primer from the vector. Restriction maps and 
partial sequences may be generated to identify one or more overlapping clones. The 

25 complete sequence may then be determined using standard techniques, which may 
involve generating a series of deletion clones. The resulting overlapping sequences can 
then assembled into a single contiguous sequence. A full length cDNA molecule can be 
generated by ligating suitable fragments, using well known techniques. 

Alternatively, amplification techniques, such as those described above, 

30 can be useful for obtaining a full length coding sequence from a partial cDNA sequence. 
One such amplification technique is inverse PCR (see Triglia et al., Nucl. Acids Res. 
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7(5:8186, 1988), which uses restriction enzymes to generate a fragment in the known 
region of the gene. The fragment is then circularized by intramolecular ligation and 
used as a template for PGR with divergent primers derived from the known region. 
Within an alternative approach, sequences adjacent to a partial sequence may be 
5 retrieved by amplification with a primer to a linker sequence and a primer specific to a 
known region. The amplified sequences are typically subjected to a second round of 
amplification with the same linker primer and a second primer specific to the known 
region. A variation on this procedure, which employs two primers that initiate 
extension in opposite directions from the known sequence, is described in WO 

10 96/38591. Another such technique is known as "rapid amplification of cDNA ends" or 
RACE. This technique involves the use of an internal primer and an external primer, 
which hybridizes to a polyA region or vector sequence, to identify sequences that are 5' 
and 3' of a known sequence. Additional techniques include capture PCR (Lagerstrom et 
al., PCR Methods Applic. 7:11 1-19, 1991) and walking PCR (Parker et al., Nucl. Acids. 

15 Res. 79:3055-60, 1991). Other methods employing amplification may also be employed 
to obtain a full length cDNA sequence. 

In certain instances, it is possible to obtain a full length cDNA sequence 
by analysis of sequences provided in an expressed sequence tag (EST) database, such as 
that available from GenBank. Searches for overlapping ESTs may generally be 

20 performed using well known programs {e.g., NCBI BLAST searches), and such ESTs 
may be used to generate a contiguous full length sequence. Full length DNA sequences 
may also be obtained by analysis of genomic fragments. 

In other embodiments of the invention, polynucleotide sequences or 
fragments thereof which encode polypeptides of the invention, or fusion proteins or 

25 functional equivalents thereof, may be used in recombinant DNA molecules to direct 
expression of a polypeptide in appropriate host cells. Due to the inherent degeneracy of 
the genetic code, other DNA sequences that encode substantially the same or a 
functionally equivalent amino acid sequence may be produced and these sequences may 
be used to clone and express a given polypeptide. 

30 As will be understood by those of skill in the art, it may be advantageous 

in some instances to produce polypeptide-encoding nucleotide sequences possessing 
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non-naturally occurring codons. For example, codons preferred by a particular 
prokaryotic or eukaryotic host can be selected to increase the rate of protein expression 
or to produce a recombinant RNA transcript having desirable properties, such as a half- 
life which is longer than that of a transcript generated from the naturally occurring 
5 sequence. 

Moreover, the polynucleotide sequences of the present invention can be 
engineered using methods generally known in the art in order to alter polypeptide 
encoding sequences for a variety of reasons, including but not limited to, alterations 
which modify the cloning, processing, and/or expression of the gene product. For 

10 example, DNA shuffling by random fragmentation and PCR reassembly of gene 
fragments and synthetic oligonucleotides may be used to engineer the nucleotide 
sequences. In addition, site-directed mutagenesis may be used to insert new restriction 
sites, alter glycosylation patterns, change codon preference, produce splice variants, or 
introduce mutations, and so forth. 

15 In another embodiment of the invention, natural, modified, or 

recombinant nucleic acid sequences may be ligated to a heterologous sequence to 
encode a fusion protein. For example, to screen peptide libraries for inhibitors of 
polypeptide activity, it may be useful to encode a chimeric protein that can be 
recognized by a commercially available antibody. A fusion protein may also be 

20 engineered to contain a cleavage site located between the polypeptide-encoding 
sequence and the heterologous protein sequence, so that the polypeptide may be cleaved 
and purified away from the heterologous moiety. 

Sequences encoding a desired polypeptide may be synthesized, in whole 
or in part, using chemical methods well known in the art (see Caruthers, M. H. et al. 

25 (1980) Nucl. Acids Res. Symp. Ser. 215-223, Horn, T. et al. (1980) Nucl. Acids Res. 
Symp. Ser. 225-232). Alternatively, the protein itself may be produced using chemical 
methods to synthesize the amino acid sequence of a polypeptide, or a portion thereof. 
For example, peptide synthesis can be performed using various solid-phase techniques 
(Roberge, J. Y. et al. (1995) Science 269:202-204) and automated synthesis may be 

30 achieved, for example, using the ABI 431 A Peptide Synthesizer (Perkin Elmer, Palo 
Alto, CA). 
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A newly synthesized peptide may be substantially purified by preparative 
high performance liquid chromatography (e.g., Creighton, T. (1983) Proteins, Structures 
and Molecular Principles, WH Freeman and Co., New York, N.Y.) or other comparable 
techniques available in the art. The composition of the synthetic peptides may be 
5 confirmed by amino acid analysis or sequencing (e.g., the Edman degradation 
procedure). Additionally, the amino acid sequence of a polypeptide, or any part thereof, 
may be altered during direct synthesis and/or combined using chemical methods with 
sequences from other proteins, or any part thereof, to produce a variant polypeptide. 

In order to express a desired polypeptide, the nucleotide sequences 

10 encoding the polypeptide, or functional equivalents, may be inserted into appropriate 
expression vector, i.e., a vector which contains the necessary elements for the 
transcription and translation of the inserted coding sequence. Methods which are well 
known to those skilled in the art may be used to construct expression vectors containing 
sequences encoding a polypeptide of interest and appropriate transcriptional and 

15 translational control elements. These methods include in vitro recombinant DNA 
techniques, synthetic techniques, and in vivo genetic recombination. Such techniques 
are described, for example, in Sambrook, J. et al. (1989) Molecular Cloning, A 
Laboratory Manual, Cold Spring Harbor Press, Plainview, N.Y., and Ausubel, F. M. et 
al. (1989) Current Protocols in Molecular Biology, John Wiley & Sons, New York. 

20 N.Y. 

A variety of expression vector/host systems may be utilized to contain 
and express polynucleotide sequences. These include, but are not limited to, 
microorganisms such as bacteria transformed with recombinant bacteriophage, plasmid, 
or cosmid DNA expression vectors; yeast transformed with yeast expression vectors; 

25 insect cell systems infected with virus expression vectors (e.g., baculovirus); plant cell 
systems transformed with virus expression vectors (e.g., cauliflower mosaic virus, 
CaMV; tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., Ti or 
pBR322 plasmids); or animal cell systems. 

The "control elements" or "regulatory sequences" present in an 

30 expression vector are those non-translated regions of the vector—enhancers, promoters, 
5' and 3' untranslated regions-which interact with host cellular proteins to carry out 
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transcription and translation. Such elements may vary in their strength and specificity. 
Depending on the vector system and host utilized, any number of suitable transcription 
and translation elements, including constitutive and inducible promoters, may be used. 
For example, when cloning in bacterial systems, inducible promoters such as the hybrid 
5 lacZ promoter of the PBLUESCRIPT phagemid (Stratagene, La Jolla, Calif.) or 
PSPORT1 plasmid (Gibco BRL, Gaithersburg, MD) and the like may be used. In 
mammalian cell systems, promoters from mammalian genes or from mammalian viruses 
are generally preferred. If it is necessary to generate a cell line that contains multiple 
copies of the sequence encoding a polypeptide, vectors based on SV40 or EBV may be 

1 0 advantageously used with an appropriate selectable marker. 

In bacterial systems, any of a number of expression vectors may be 
selected depending upon the use intended for the expressed polypeptide. For example, 
when large quantities are needed, for example for the induction of antibodies, vectors 
which direct high level expression of fusion proteins that are readily purified may be 

15 used. Such vectors include, but are not limited to, the multifunctional E. coli cloning 
and expression vectors such as BLUESCRIPT (Stratagene), in which the sequence 
encoding the polypeptide of interest may be ligated into the vector in frame with 
sequences for the amino-terminal Met and the subsequent 7 residues of .beta.- 
galactosidase so that a hybrid protein is produced; pIN vectors (Van Heeke, G. and S. 

20 M. Schuster (1989) J. Biol. Chem. 264:5503-5509); and the like. pGEX Vectors ' 
(Promega, Madison, Wis.) may also be used to express foreign polypeptides as fusion 
proteins with glutathione S-transferase (GST). In general, such fusion proteins are 
soluble and can easily be purified from lysed cells by adsorption to glutathione-agarose 
beads followed by elution in the presence of free glutathione. Proteins made in such 

25 systems may be designed to include heparin, thrombin, or factor XA protease cleavage 
sites so that the cloned polypeptide of interest can be released from the GST moiety at 
will. 

In the yeast, Saccharomyces cerevisiae, a number of vectors containing 
constitutive or inducible promoters such as alpha factor, alcohol oxidase, and PGH may 
30 be used. For reviews, see Ausubel et al. (supra) and Grant et al. (1987) Methods 
Enzymol. 153:516-544. 
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In cases where plant expression vectors are used, the expression of 
sequences encoding polypeptides may be driven by any of a number of promoters. For 
example, viral promoters such as the 35S and 19S promoters of CaMV may be used 
alone or in combination with the omega leader sequence from TMV (Takamatsu, N. 
5 (1987) EMBO J. 5:307-31 1. Alternatively, plant promoters such as the small subunit of 
RUBISCO or heat shock promoters may be used (Coruzzi, G. et al. (1984) EMBO J. 
3:1671-1680; Broglie, R. et al. (1984) Science 224:838-843; and Winter, J. et al. (1991) 
Results Probl. Cell Differ. 77:85-105). These constructs can be introduced into plant 
cells by direct DNA transformation or pathogen-mediated transfection. Such techniques 

10 are described in a number of generally available reviews (see, for example, Hobbs, S. or 
Murry, L. E. in McGraw Hill Yearbook of Science and Technology (1992) McGraw 
Hill, New York, N.Y.; pp. 191-196). 

An insect system may also be used to express a polypeptide of interest. 
For example, in one such system, Autographa californica nuclear polyhedrosis virus 

15 (AcNPV) is used as a vector to express foreign genes in Spodoptera frugiperda cells or 
in Trichoplusia larvae. The sequences encoding the polypeptide may be cloned into a 
non-essential region of the virus, such as the polyhedrin gene, and placed under control 
of the polyhedrin promoter. Successful insertion of the polypeptide-encoding sequence 
will render the polyhedrin gene inactive and produce recombinant virus lacking coat 

20 protein. The recombinant viruses may then be used to infect, for example, S. frugiperda 
cells or Trichoplusia larvae in which the polypeptide of interest may be expressed 
(Engelhard, E. K. et al. (1994) Proc. Natl. Acad. Set. 91 :3224-3227). 

In mammalian host cells, a number of viral-based expression systems are 
generally available. For example, in cases where an adenovirus is used as an expression 

25 vector, sequences encoding a polypeptide of interest may be ligated into an adenovirus 
transcription/translation complex consisting of the late promoter and tripartite leader 
sequence. Insertion in a non-essential El or E3 region of the viral genome may be used 
to obtain a viable virus which is capable of expressing the polypeptide in infected host 
cells (Logan, J. and Shenk, T. (1984) Proc. Natl. Acad. Sci. 81:3655-3659). In addition, 

30 transcription enhancers, such as the Rous sarcoma virus (RSV) enhancer, may be used 
to increase expression in mammalian host cells. 
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Specific initiation signals may also be used to achieve more efficient 
translation of sequences encoding a polypeptide of interest. Such signals include the 
ATG initiation codon and adjacent sequences. In cases where sequences encoding the 
polypeptide, its initiation codon, and upstream sequences are inserted into the 
5 appropriate expression vector, no additional transcriptional or translational control 
signals may be needed. However, in cases where only coding sequence, or a portion 
thereof, is inserted, exogenous translational control signals including the ATG initiation 
codon should be provided. Furthermore, the initiation codon should be in the correct 
reading frame to ensure translation of the entire insert. Exogenous translational 

10 elements and initiation codons may be of various origins, both natural and synthetic. 
The efficiency of expression may be enhanced by the inclusion of enhancers which are 
appropriate for the particular cell system which is used, such as those described in the 
literature (Scharf, D. et al. (1994) Results Probl. Cell Differ. 20:125-162). 

In addition, a host cell strain may be chosen for its ability to modulate 

15 the expression of the inserted sequences or to process the expressed protein in the 
desired fashion. Such modifications of the polypeptide include, but are not limited to, 
acetylation, carboxylation. glycosylation, phosphorylation, lipidation, and acylation. 
Post-translational processing which cleaves a "prepro" form of the protein may also be 
used to facilitate correct insertion, folding and/or function. Different host cells such as 

20 CHO, COS, HeLa, MDCK, HEK293, and WI3 8, which have specific cellular machinery 
and characteristic mechanisms for such post-translational activities, may be chosen to 
ensure the correct modification and processing of the foreign protein. 

For long-term, high-yield production of recombinant proteins, stable 
expression is generally preferred. For example, cell lines which stably express a 

25 polynucleotide of interest may be transformed using expression vectors which may 
contain viral origins of replication and/or endogenous expression elements and a 
selectable marker gene on the same or on a separate vector. Following the introduction 
of the vector, cells may be allowed to grow for 1-2 days in an enriched media before 
they are switched to selective media. The purpose of the selectable marker is to confer 

30 resistance to selection, and its presence allows growth and recovery of cells which 



63 



WO 02/47534 



PCT7US01/47576 



successfully express the introduced sequences. Resistant clones of stably transformed 
cells may be proliferated using tissue culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed 
cell lines. These include, but are not limited to, the herpes simplex virus thymidine 
5 kinase (Wigler, M. et al. (1977) Cell 7i:223-32) and adenine phosphoribosyltransferase 
(Lowy, I. et al. (1990) Cell 22:817-23) genes which can be employed in tk.sup.- or 
aprt.sup.- cells, respectively. Also, antimetabolite, antibiotic or herbicide resistance can 
be used as the basis for selection; for example, dhfr which confers resistance to 
methotrexate (Wigler, M. et al. (1980) Proc. Natl. Acad. Sci. 77:3567-70); npt, which 

10 confers resistance to the aminoglycosides, neomycin and G-418 (Colbere-Garapin, F. et 
al (1981) J. Mol. Biol. 750:1-14); and als or pat, which confer resistance to 
chlorsulfuron and phosphinotricin acetyltransferase, respectively (Murry, supra). 
Additional selectable genes have been described, for example, trpB, which allows cells 
to utilize indole in place of tryptophan, or hisD, which allows cells to utilize histinol in 

15 place of histidine (Hartman, S. C. and R. C. Mulligan (1988) Proc. Natl. Acad. Sci. 
55:8047-51). The use of visible markers has gained popularity with such markers as 
anthocyanins, beta-glucuronidase and its substrate GUS, and luciferase and its substrate 
luciferin, being widely used not only to identify transformants, but also to quantify the 
amount of transient or stable protein expression attributable to a specific vector system 

20 (Rhodes, C. A. et al. (1995) Methods Mol. Biol. 55:121-131). 

Although the presence/absence of marker gene expression suggests that 
the gene of interest is also present, its presence and expression may need to be 
confirmed. For example, if the sequence encoding a polypeptide is inserted within a 
marker gene sequence, recombinant cells containing sequences can be identified by the 

25 absence of marker gene function. Alternatively, a marker gene can be placed in tandem 
with a polypeptide-encoding sequence under the control of a single promoter. 
Expression of the marker gene in response to induction or selection usually indicates 
expression of the tandem gene as well. 

Alternatively, host cells that contain and express a desired 

30 polynucleotide sequence may be identified by a variety of procedures known to those of 
skill in the art. These procedures include, but are not limited to, DNA-DNA or DNA- 
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RNA hybridizations and protein bioassay or immunoassay techniques which include, 
for example, membrane, solution, or chip based technologies for the detection and/or 
quantification of nucleic acid or protein. 

A variety of protocols for detecting and measuring the expression of 
5 polynucleotide-encoded products, using either polyclonal or monoclonal antibodies 
specific for the product are known in the art. Examples include enzyme-linked 
immunosorbent assay (ELISA), radioimmunoassay (RIA), and fluorescence activated 
cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing monoclonal 
antibodies reactive to two non-interfering epitopes on a given polypeptide may be 

1 0 preferred for some applications, but a competitive binding assay may also be employed. 
These and other assays are described, among other places, in Hampton, R. et al. (1990; 
Serological Methods, a Laboratory Manual, APS Press, St Paul. Minn.) and Maddox, D. 
E. et al. (1983; J. Exp. Med. 158:121 1-1216). 

A wide variety of labels and conjugation techniques are known by those 

1 5 skilled in the art and may be used in various nucleic acid and amino acid assays. Means 
for producing labeled hybridization or PCR probes for detecting sequences related to 
polynucleotides include oligolabeling, nick translation, end-labeling or PCR 
amplification using a labeled nucleotide. Alternatively, the sequences, or any portions 
thereof may be cloned into a vector for the production of an mRNA probe. Such vectors 

20 are known in the art, are commercially available, and may be used to synthesize RNA 
probes in vitro by addition of an appropriate RNA polymerase such as T7, T3, or SP6 
and labeled nucleotides. These procedures may be conducted using a variety of 
commercially available kits. Suitable reporter molecules or labels, which may be used 
include radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic agents 

25 as well as substrates, cofactors, inhibitors, magnetic particles, and the like. 

Host cells transformed with a polynucleotide sequence of interest may be 
cultured under conditions suitable for the expression and recovery of the protein from 
cell culture. The protein produced by a recombinant cell may be secreted or contained 
intracellularly depending on the sequence and/or the vector used. As will be understood 

30 by those of skill in the art, expression vectors containing polynucleotides of the 
invention may be designed to contain signal sequences which direct secretion of the 
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encoded polypeptide through a prokaryotic or eukaryotic cell membrane. Other 
recombinant constructions may be used to join sequences encoding a polypeptide of 
interest to nucleotide sequence encoding a polypeptide domain which will facilitate 
purification of soluble proteins. Such purification facilitating domains include, but are 
5 not limited to, metal chelating peptides such as histidine-tryptophan modules that allow 
purification on immobilized metals, protein A domains that allow purification on 
immobilized immunoglobulin, and the domain utilized in the FLAGS extension/affinity 
purification system (Immunex Corp., Seattle, Wash.). The inclusion of cleavable linker 
sequences such as those specific for Factor XA or enterokinase (Invitrogen. San Diego, 

10 Calif.) between the purification domain and the encoded polypeptide may be used to 
facilitate purification. One such expression vector provides for expression of a fusion 
protein containing a polypeptide of interest and a nucleic acid encoding 6 histidine 
residues preceding a thioredoxin or an enterokinase cleavage site. The histidine residues 
facilitate purification on IMIAC (immobilized metal ion affinity chromatography) as 

15 described in Porath, J. et al. (1992, Prot. Exp. Purif. 5:263-281) while the enterokinase 
cleavage site provides a means for purifying the desired polypeptide from the fusion 
protein. A discussion of vectors which contain fusion proteins is provided in Kroll, D. J. 
et al. (1993; DNA Cell Biol. 72:441-453). 

In addition to recombinant production methods, polypeptides of the 

20 invention, and fragments thereof, may be produced by direct peptide synthesis using 
solid-phase techniques (Merrifield J. (1963) J. Am. Chem. Soc. 85:2149-2154). Protein 
synthesis may be performed using manual techniques or by automation. Automated 
synthesis may be achieved, for example, using Applied Biosystems 431 A Peptide 
Synthesizer (Perkin Elmer). Alternatively, various fragments may be chemically 

25 synthesized separately and combined using chemical methods to produce the full length 
molecule. 

Antibody Compositions, Fragments Thereof and Other Binding Agents 

According to another aspect, the present invention further provides 
binding agents, such as antibodies and antigen-binding fragments thereof, that exhibit 
30 immunological binding to a tumor polypeptide disclosed herein, or to a portion, variant 



66 



WO 02/47534 



PCT7US01/47576 



or derivative thereof. An antibody, or antigen-binding fragment thereof, is said to 
"specifically bind," "immunogically bind," and/or is "immunologically reactive" to a 
polypeptide of the invention if it reacts at a detectable level (within, for example, an 
ELISA assay) with the polypeptide, and does not react detectably with unrelated 
5 polypeptides under similar conditions. 

Immunological binding, as used in this context, generally refers to the 
non-covalent interactions of the type which occur between an immunoglobulin 
molecule and an antigen for which the immunoglobulin is specific. The strength, or 
affinity of immunological binding interactions can be expressed in terms of the 

10 dissociation constant (Kd) of the interaction, wherein a smaller K<j represents a greater 
affinity. Immunological binding properties of selected polypeptides can be quantified 
using methods well known in the ait. One such method entails measuring the rates of 
antigen-binding site/antigen complex formation and dissociation, wherein those rates 
depend on the concentrations of the complex partners, the affinity of the interaction, and 

15 on geometric parameters that equally influence the rate in both directions. Thus, both 
the "on rate constant" (Ko n ) and the "off rate constant" (K 0 ff) can be determined by 
calculation of the concentrations and the actual rates of association and dissociation. 
The ratio of K 0 ff /K on enables cancellation of all parameters not related to affinity, and is 
thus equal to the dissociation constant Kd. See, generally, Davies et al. (1990) Annual 

20 Rev. Biochem. 59:439-473. 

An "antigen-binding site," or "binding portion" of an antibody refers to 
the part of the immunoglobulin molecule that participates in antigen binding. The 
antigen binding site is formed by amino acid residues of the N-terminal variable ("V") 
regions of the heavy ("H") and light ("L") chains. Three highly divergent stretches 

25 within the V regions of the heavy and light chains are referred to as "hypervariable 
regions" which are interposed between more conserved flanking stretches known as 
"framework regions," or "FRs". Thus the term "FR" refers to amino acid sequences 
which are naturally found between and adjacent to hypervariable regions in 
immunoglobulins. In an antibody molecule, the three hypervariable regions of a light 

30 chain and the three hypervariable regions of a heavy chain are disposed relative to each 
other in three dimensional space to form an antigen-binding surface. The antigen- 
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binding surface is complementary to the three-dimensional surface of a bound antigen, 
and the three hypervariable regions of each of the heavy and light chains are referred to 
as "complementarity-determining regions," or "CDRs." 

Binding agents may be further capable of differentiating between patients 
5 with and without a cancer, such as lung cancer, using the representative assays provided 
herein. For example, antibodies or other binding agents that bind to a tumor protein 
will preferably generate a signal indicating the presence of a cancer in at least about 
20% of patients with the disease, more preferably at least about 30% of patients. 
Alternatively, or in addition, the antibody will generate a negative signal indicating the 

10 absence of the disease in at least about 90% of individuals without the cancer. To 
determine whether a binding agent satisfies this requirement, biological samples (e.g., 
blood, sera, sputum, urine and/or tumor biopsies) from patients with and without a 
cancer (as determined using standard clinical tests) may be assayed as described herein 
for the presence of polypeptides that bind to the binding agent. Preferably, a statistically 

15 significant number of samples with and without the disease will be assayed. Each 
binding agent should satisfy the above criteria; however, those of ordinary skill in the 
art will recognize that binding agents may be used in combination to improve 
sensitivity. 

Any agent that satisfies the above requirements may be a binding agent. 

20 For example, a binding agent may be a ribosome, with or without a peptide component, 
an RNA molecule or a polypeptide. In a preferred embodiment, a binding agent is an 
antibody or an antigen-binding fragment thereof. Antibodies may be prepared by any of 
a variety of techniques known to those of ordinary skill in the art. See, e.g., Harlow and 
Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 1988. In 

25 general, antibodies can be produced by cell culture techniques, including the generation 
of monoclonal antibodies as described herein, or via transfection of antibody genes into 
suitable bacterial or mammalian cell hosts, in order to allow for the production of 
recombinant antibodies. In one technique, an immunogen comprising the polypeptide is 
initially injected into any of a wide variety of mammals (e.g., mice, rats, rabbits, sheep 

30 or goats). In this step, the polypeptides of this invention may serve as the immunogen 
without modification. Alternatively, particularly for relatively short polypeptides, a 
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superior immune response may be elicited if the polypeptide is joined to a carrier 
protein, such as bovine serum albumin or keyhole limpet hemocyanin. The immunogen 
is injected into the animal host, preferably according to a predetermined schedule 
incorporating one or more booster immunizations, and the animals are bled periodically. 
5 Polyclonal antibodies specific for the polypeptide may then be purified from such 
antisera by, for example, affinity chromatography using the polypeptide coupled to a 
suitable solid support. 

Monoclonal antibodies specific for an antigenic polypeptide of interest 
may be prepared, for example, using the technique of Kohler and Milstein, Eur. J. 

10 Immunol. 5:511-519, 1976, and improvements thereto. Briefly, these methods involve 
the preparation of immortal cell lines capable of producing antibodies having the 
desired specificity (i.e., reactivity with the polypeptide of interest). Such cell lines may 
be produced, for example, from spleen cells obtained from an animal immunized as 
described above. The spleen cells are then immortalized by, for example, fusion with a 

15 myeloma cell fusion partner, preferably one that is syngeneic with the immunized 
animal. A variety of fusion techniques may be employed. For example, the spleen cells 
and myeloma cells may be combined with a nonionic detergent for a few minutes and 
then plated at low density on a selective medium that supports the growth of hybrid 
cells, but not myeloma cells. A preferred selection technique uses HAT (hypoxanthine, 

20 aminopterin, thymidine) selection. After a sufficient time, usually about 1 to 2 weeks, 
colonies of hybrids are observed. Single colonies are selected and their culture 
supernatants tested for binding activity against the polypeptide. Hybridomas having 
high reactivity and specificity are preferred. 

Monoclonal antibodies may be isolated from the supernatants of growing 

25 hybridoma colonies. In addition, various techniques may be employed to enhance the 
yield, such as injection of the hybridoma cell line into the peritoneal cavity of a suitable 
vertebrate host, such as a mouse. Monoclonal antibodies may then be harvested from 
the ascites fluid or the blood. Contaminants may be removed from the antibodies by 
conventional techniques, such as chromatography, gel filtration, precipitation, and 

30 extraction. The polypeptides of this invention may be used in the purification process 
in, for example, an affinity chromatography step. 
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A number of therapeutically useful molecules are known in the art which 
comprise antigen-binding sites that are capable of exhibiting immunological binding 
properties of an antibody molecule. The proteolytic enzyme papain preferentially 
cleaves IgG molecules to yield several fragments, two of which (the "F(ab) M fragments) 
5 each comprise a covalent heterodimer that includes an intact antigen-binding site. The 
enzyme pepsin is able to cleave IgG molecules to provide several fragments, including 
the "F(ab')2 " fragment which comprises both antigen-binding sites. An "Fv" fragment 
can be produced by preferential proteolytic cleavage of an IgM, and on rare occasions 
IgG or IgA immunoglobulin molecule. Fv fragments are, however, more commonly 

10 derived using recombinant techniques known in the art. The Fv fragment includes a 
non-covalent Vh::Vl heterodimer including an antigen-binding site which retains much 
of the antigen recognition and binding capabilities of the native antibody molecule. 
Inbar et al. (1972) Proc. Nat. Acad. Sci. USA 69:2659-2662; Hochman et al. (1976) 
Biochem 15:2706-2710; and Ehrlich et al. (1980) Biochem 19:4091-4096. 

15 A single chain Fv ("sFv") polypeptide is a covalently linked Vh::Vl 

heterodimer which is expressed from a gene fusion including Vh- and VL-encoding 
genes linked by a peptide-encoding linker. Huston et al. (1988) Proc. Nat. Acad. Sci. 
USA 85(16):5879-5883. A number of methods have been described to discern chemical 
structures for converting the naturally aggregated~but chemically separated-light and 

20 heavy polypeptide chains from an antibody V region into an sFv molecule which will 
fold into a three dimensional structure substantially similar to the structure of an 
antigen-binding site. See, e.g., U.S. Pat. Nos. 5,091,513 and 5,132,405, to Huston et al.; 
and U.S. Pat. No. 4,946,778, to Ladner et al. 

Each of the above-described molecules includes a heavy chain and a 

25 light chain CDR set, respectively interposed between a heavy chain and a light chain FR 
set which provide support to the CDRS and define the spatial relationship of the CDRs 
relative to each other. As used herein, the term "CDR set" refers to the three 
hypervariable regions of a heavy or light chain V region. Proceeding from the N- 
terminus of a heavy or light chain, these regions are denoted as "CDR1," "CDR2," and 

30 "CDR3" respectively. An antigen-binding site, therefore, includes six CDRs, 
comprising the CDR set from each of a heavy and a light chain V region. A polypeptide 
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comprising a single CDR, (e.g., a CDR1, CDR2 or CDR3) is referred to herein as a 
"molecular recognition unit." Crystallographic analysis of a number of antigen-antibody 
complexes has demonstrated that the amino acid residues of CDRs form extensive 
contact with bound antigen, wherein the most extensive antigen contact is with the 
5 heavy chain CDR3. Thus, the molecular recognition units are primarily responsible for 
the specificity of an antigen-binding site. 

As used herein, the term "FR set" refers to the four flanking amino acid 
sequences which frame the CDRs of a CDR set of a heavy or light chain V region. 
Some FR residues may contact bound antigen; however, FRs are primarily responsible 

10 for folding the V region into the antigen-binding site, particularly the FR residues 
directly adjacent to the CDRS. Within FRs, certain amino residues and certain structural 
features are very highly conserved. In this regard, all V region sequences contain an 
internal disulfide loop of around 90 amino acid residues. When the V regions fold into a. 
binding-site, the CDRs are displayed as projecting loop motifs which form an antigen- 

15 binding surface. It is generally recognized that there are conserved structural regions of 
FRs which influence the folded shape of the CDR loops into certain "canonical" 
structures-regardless of the precise CDR amino acid sequence. Further, certain FR 
residues are known to participate in non-covalent interdomain contacts which stabilize 
the interaction of the antibody heavy and light chains. 

20 A number of "humanized" antibody molecules comprising an antigen- 

binding site derived from a non-human immunoglobulin have been described, including 
chimeric antibodies having rodent V regions and their associated CDRs fused to human 
constant domains (Winter et al. (1991) Nature 349:293-299; Lobuglio et al. (1989) 
Proc. Nat. Acad. Sci. USA 86:4220-4224; Shaw et al. (1987) J Immunol. 138:4534- 

25 4538; and Brown et al. (1987) Cancer Res. 47:3577-3583), rodent CDRs grafted into a 
human supporting FR prior to fusion with an appropriate human antibody constant 
domain (Riechmann et al. (1988) Nature 332:323-327; Verhoeyen et al. (1988) Science 
239:1534-1536; and Jones et al. (1986) Nature 321:522-525), and rodent CDRs 
supported by recombinantly veneered rodent FRs (European Patent Publication No. 

30 519,596, published Dec. 23, 1992). These "humanized" molecules are designed to 
minimize unwanted immunological response toward rodent antihuman antibody 
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molecules which limits the duration and effectiveness of therapeutic applications of 
those moieties in human recipients. 

As used herein, the terms "veneered FRs" and "recombinantly veneered 
FRs" refer to the selective replacement of FR residues from, e.g., a rodent heavy or light 
5 chain V region, with human FR residues in order to provide a xenogeneic molecule 
comprising an antigen-binding site which retains substantially all of the native FR 
polypeptide folding structure. Veneering techniques are based on the understanding that 
the ligand binding characteristics of an antigen-binding site are determined primarily by 
the structure and relative disposition of the heavy and light chain CDR sets within the 

10 antigen-binding surface. Davies et al. (1990) Ann. Rev. Biochem. 59:439-473. Thus, 
antigen binding specificity can be preserved in a humanized antibody only wherein the 
CDR structures, their interaction with each other, and their interaction with the rest of 
the V region domains are carefully maintained. By using veneering techniques, exterior 
(e.g., solvent-accessible) FR residues which are readily encountered by the immune 

15 system are selectively replaced with human residues to provide a hybrid molecule that 
comprises either a weakly immunogenic, or substantially non-immunogenic veneered 
surface. 

The process of veneering makes use of the available sequence data for 
human antibody variable domains compiled by Kabat et al., in Sequences of Proteins of 

20 Immunological Interest, 4th ed., (U.S. Dept. of Health and Human Services, U.S. 
Government Printing Office, 1987), updates to the Kabat database, and other accessible 
U.S. and foreign databases (both nucleic acid and protein). Solvent accessibilities of V 
region amino acids can be deduced from the known three-dimensional structure for 
human and murine antibody fragments. There are two general steps in veneering a 

25 murine antigen-binding site. Initially, the FRs of the variable domains of an antibody 
molecule of interest are compared with corresponding FR sequences of human variable 
domains obtained from the above-identified sources. The most homologous human V 
regions are then compared residue by residue to corresponding murine amino acids. The 
residues in the murine FR which differ from the human counterpart are replaced by the 

30 residues present in the human moiety using recombinant techniques well known in the 
art. Residue switching is only carried out with moieties which are at least partially 
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exposed (solvent accessible), and care is exercised in the replacement of amino acid 
residues which may have a significant effect on the tertiary structure of V region 
domains, such as proline, glycine and charged amino acids. 

In this manner, the resultant "veneered" murine antigen-binding sites are 
5 thus designed to retain the murine CDR residues, the residues substantially adjacent to 
the CDRs, the residues identified as buried or mostly buried (solvent inaccessible), the 
residues believed to participate in non-covalent (e.g., electrostatic and hydrophobic) 
contacts between heavy and light chain domains, and the residues from conserved 
structural regions of the FRs which are believed to influence the "canonical" tertiary 

1 0 structures of the CDR loops. These design criteria are then used to prepare recombinant 
nucleotide sequences which combine the CDRs of both the heavy and light chain of a 
murine antigen-binding site into human-appearing FRs that can be used to transfect 
mammalian cells for the expression of recombinant human antibodies which exhibit the 
antigen specificity of the murine antibody molecule. 

15 In another embodiment of the invention, monoclonal antibodies of the 

present invention may be coupled to one or more therapeutic agents. Suitable agents in 
this regard include radionuclides, differentiation inducers, drugs, toxins, and derivatives 
thereof. Preferred radionuclides include 90 Y, 123 I, I25 I, 131 I, 186 Re, 188 Re, 211 At, and 
212 Bi. Preferred drugs include methotrexate, and pyrimidine and purine analogs. 

20 Preferred differentiation inducers include phorbol esters and butyric acid. Preferred 
toxins include ricin, abrin, diptheria toxin, cholera toxin, gelonin, Pseudomonas 
exotoxin, Shigella toxin, and pokeweed antiviral protein. 

A therapeutic agent may be coupled (e.g., covalently bonded) to a 
suitable monoclonal antibody either directly or indirectly (e.g., via a linker group). A 

25 direct reaction between an agent and an antibody is possible when each possesses a 
substituent capable of reacting with the other. For example, a nucleophilic group, such 
as an amino or sulfhydryl group, on one may be capable of reacting with a carbonyl- 
containing group, such as an anhydride or an acid halide, or with an alkyl group 
containing a good leaving group (e.g., a halide) on the other. 

30 Alternatively, it may be desirable to couple a therapeutic agent and an 

antibody via a linker group. A linker group can function as a spacer to distance an 
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antibody from an agent in order to avoid interference with binding capabilities. A linker 
group can also serve to increase the chemical reactivity of a substituent on an agent or 
an antibody, and thus increase the coupling efficiency. An increase in chemical 
reactivity may also facilitate the use of agents, or functional groups on agents, which 
5 otherwise would not be possible. 

It will be evident to those skilled in the art that a variety of bifunctional 
or polyfunctional reagents, both homo- and hetero-functional (such as those described in 
the catalog of the Pierce Chemical Co., Rockford, IL), may be employed as the linker 
group. Coupling may be effected, for example, through amino groups, carboxyl groups, 

10 sulfhydryl groups or oxidized carbohydrate residues. There are numerous references 
describing such methodology, e.g., U.S. Patent No. 4,671,958, to Rodwell et al. 

Where a therapeutic agent is more potent when free from the antibody 
portion of the immunoconjugates of the present invention, it may be desirable to use a 
linker group which is cleavable during or upon internalization into a cell. A number of 

15 different cleavable linker groups have been described. The mechanisms for the 
intracellular release of an agent from these linker groups include cleavage by reduction 
of a disulfide bond (e.g., U.S. Patent No. 4,489,710, to Spitler), by irradiation of a 
photolabile bond (e.g., U.S. Patent No. 4,625,014, to Senter etal.), by hydrolysis of 
derivatized amino acid side chains (e.g., U.S. Patent No. 4,638,045, to Kohn et al), by 

20 serum complement-mediated hydrolysis (e.g., U.S. Patent No. 4,671,958, to Rodwell 
et al.), and acid-catalyzed hydrolysis (e.g., U.S. Patent No. 4,569,789, to Blattler et al.). 

It may be desirable to couple more than one agent to an antibody. In one 
embodiment, multiple molecules of an agent are coupled to one antibody molecule. In 
another embodiment, more than one type of agent may be coupled to one antibody. 

25 Regardless of the particular embodiment, immunoconjugates with more than one agent 
may be prepared in a variety of ways. For example, more than one agent may be 
coupled directly to an antibody molecule, or linkers that provide multiple sites for 
attachment can be used. Alternatively, a carrier can be used. 

A carrier may bear the agents in a variety of ways, including covalent 

30 bonding either directly or via a linker group. Suitable carriers include proteins such as 
albumins (e.g., U.S. Patent No. 4,507,234, to Kato et al), peptides and polysaccharides 
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such as aminodextran (e.g., U.S. Patent No. 4,699,784, to Shih et al). A carrier may 
also bear an agent by noncovalent bonding or by encapsulation, such as within a 
liposome vesicle (e.g., U.S. Patent Nos. 4,429,008 and 4,873,088). Carriers specific for 
radionuclide agents include radiohalogenated small molecules and chelating 
5 compounds. For example, U.S. Patent No. 4,735,792 discloses representative 
radiohalogenated small molecules and their synthesis. A radionuclide chelate may be 
formed from chelating compounds that include those containing nitrogen and sulfur 
atoms as the donor atoms for binding the metal, or metal oxide, radionuclide. For 
example, U.S. Patent No. 4,673,562, to Davison et al. discloses representative chelating 
1 0 compounds and their synthesis. 



T Cell Compositions 

The present invention, in another aspect, provides T cells specific for a 
tumor polypeptide disclosed herein, or for a variant or derivative thereof. Such cells 
may generally be prepared in vitro or ex vivo, using standard procedures. For example, 

15 T cells may be isolated from bone marrow, peripheral blood, or a fraction of bone 
marrow or peripheral blood of a patient, using a commercially available cell separation 
system, such as the Isolex™ System, available from Nexell Therapeutics, Inc. (Irvine, 
CA; see also U.S. Patent No. 5,240,856; U.S. Patent No. 5,215,926; WO 89/06280; WO 
91/16116 and WO 92/07243). Alternatively, T cells may be derived from related or 

20 unrelated humans, non-human mammals, cell lines or cultures. 

T cells may be stimulated with a polypeptide, polynucleotide encoding a 
polypeptide and/or an antigen presenting cell (APC) that expresses such a polypeptide. 
Such stimulation is performed under conditions and for a time sufficient to permit the 
generation of T cells that are specific for the polypeptide of interest. Preferably, a tumor 

25 polypeptide or polynucleotide of the invention is present within a delivery vehicle, such 
as a microsphere, to facilitate the generation of specific T cells. 

T cells are considered to be specific for a polypeptide of the present 
invention if the T cells specifically proliferate, secrete cytokines or kill target cells 
coated with the polypeptide or expressing a gene encoding the polypeptide. T cell 

30 specificity may be evaluated using any of a variety of standard techniques. For 
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example, within a chromium release assay or proliferation assay, a stimulation index of 
more than two fold increase in lysis and/or proliferation, compared to negative controls, 
indicates T cell specificity. Such assays may be performed, for example, as described in 
Chen et al., Cancer Res. 54:1065-1070, 1994. Alternatively, detection of the 
5 proliferation of T cells may be accomplished by a variety of known techniques. For 
example, T cell proliferation can be detected by measuring an increased rate of DNA 
synthesis (e.g., by pulse-labeling cultures of T cells with tritiated thymidine and 
measuring the amount of tritiated thymidine incorporated into DNA). Contact with a 
tumor polypeptide (100 ng/ml - 100 ug/ml, preferably 200 ng/ml - 25 ug/mi) for 3 - 7 

10 days will typically result in at least a two fold increase in proliferation of the T cells. 
Contact as described above for 2-3 hours should result in activation of the T cells, as 
measured using standard cytokine assays in which a two fold increase in the level of 
cytokine release (e.g., TNF or IFN-y) is indicative of T cell activation (see Coligan et 
al., Current Protocols in Immunology, vol. 1, Wiley Interscience (Greene 1998)). T 

15 cells that have been activated in response to a tumor polypeptide, polynucleotide or 
polypeptide-expressing APC may be CD4 + and/or CD8 + . Tumor polypeptide-specific T 
cells may be expanded using standard techniques. Within preferred embodiments, the T 
cells are derived from a patient, a related donor or an unrelated donor, and are 
administered to the patient following stimulation and expansion. 

20 For therapeutic purposes, CD4 + or CD8 + T cells that proliferate in 

response to a tumor polypeptide, polynucleotide or APC can be expanded in number 
either in vitro or in vivo. Proliferation of such T cells in vitro may be accomplished in a 
variety of ways. For example, the T cells can be re-exposed to a tumor polypeptide, or a 
short peptide corresponding to an immunogenic portion of such a polypeptide, with or 

25 without the addition of T cell growth factors, such as interleukin-2, and/or stimulator 
cells that synthesize a tumor polypeptide. Alternatively, one or more T cells that 
proliferate in the presence of the tumor polypeptide can be expanded in number by 
cloning. Methods for cloning cells are well known in the art, and include limiting 
dilution. 
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T Cell Receptor Compositions 

The T cell receptor (TCR) consists of 2 different, highly variable 
polypeptide chains, termed the T-cell receptor a and (3 chains, that are linked by a 
disulfide bond (Janeway, Travers, Walport. Immunobiology. Fourth Ed., 148-159. 
5 Elsevier Science Ltd/Garland Publishing. 1999). The oc/p heterodimer complexes with 
the invariant CD3 chains at the cell membrane. This complex recognizes specific 
antigenic peptides bound to MHC molecules. The enormous diversity of TCR 
specificities is generated much like immunoglobulin diversity, through somatic gene 
rearrangement. The p chain genes contain over 50 variable (V), 2 diversity (D), over 10 

10 joining (J) segments, and 2 constant region segments (C). The a chain genes contain 
over 70 V segments, and over 60 J segments but no D segments, as well as one C 
segment. During T cell development in the thymus, the D to J gene rearrangement of 
the p chain occurs, followed by the V gene segment rearrangement to the DJ. This 
functional VDJp exon is transcribed and spliced to join to a Cp. For the a chain, a V a 

15 gene segment rearranges to a J a gene segment to create the functional exon that is then 
transcribed and spliced to the C a . Diversity is further increased during the 
recombination process by the random addition of P and N-nucleotides between the V, 
D, and J segments of the P chain and between the V and J segments in the a chain 
(Janeway, Travers, Walport. Immunobiology. Fourth Ed., 98 and 150. Elsevier Science 

20 Ltd/Garland Publishing. 1999). 

The present invention, in another aspect, provides TCRs specific for a 
polypeptide disclosed herein, or for a variant or derivative thereof. In accordance with 
the present invention, polynucleotide and amino acid sequences are provided for the V-J 
or V-D-J junctional regions or parts thereof for the alpha and beta chains of the T-cell 

25 receptor which recognize tumor polypeptides described herein. In general, this aspect 
of the invention relates to T-cell receptors which recognize or bind tumor polypeptides 
presented in the context of MHC. In a preferred embodiment the tumor antigens 
recognized by the T-cell receptors comprise a polypeptide of the present invention. For 
example, cDNA encoding a TCR specific for a _tumor peptide can be isolated from T 

30 cells specific for a tumor polypeptide using standard molecular biological and 
recombinant DNA techniques. 
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This invention further includes the T-cell receptors or analogs thereof 
having substantially the same function or activity as the T-cell receptors of this 
invention which recognize or bind tumor polypeptides. Such receptors include, but are 
not limited to, a fragment of the receptor, or a substitution, addition or deletion mutant 
5 of a T-cell receptor provided herein. This invention also encompasses polypeptides or 
peptides that are substantially homologous to the T-cell receptors provided herein or 
that retain substantially the same activity. The term "analog" includes any protein or 
polypeptide having an amino acid residue sequence substantially identical to the T-cell 
receptors provided herein in which one or more residues, preferably no more than 5 

1 0 residues, more preferably no more than 25 residues have been conservatively substituted 
with a functionally similar residue and which displays the functional aspects of the T- 
cell receptor as described herein. 

The present invention further provides for suitable mammalian host 
cells, for example, non-specific T cells, that are transfected with a polynucleotide 

1 5 encoding TCRs specific for a polypeptide described herein, thereby rendering the host 
cell specific for the polypeptide. The a and |3 chains of the TCR may be contained on 
separate expression vectors or alternatively, on a single expression vector that also 
contains an internal ribosome entry site (IRES) for cap-independent translation of the 
gene downstream of the IRES. Said host cells expressing TCRs specific for the 

20 polypeptide may be used, for example, for adoptive immunotherapy of lung cancer as 
discussed further below. 

In further aspects of the present invention, cloned TCRs specific for a 
polypeptide recited herein may be used in a kit for the diagnosis of lung cancer. For 
example, the nucleic acid sequence or portions thereof, of tumor-specific TCRs can be 
25 used as probes or primers for the detection of expression of the rearranged genes 
encoding the specific TCR in a biological sample. Therefore, the present invention 
further provides for an assay for detecting messenger RNA or DNA encoding the TCR 
specific for a polypeptide. 
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Pharmaceutical Compositions 

In additional embodiments, the present invention concerns formulation 
of one or more of the polynucleotide, polypeptide, T-cell and/or antibody compositions 
disclosed herein in pharmaceutically-acceptable carriers for administration to a cell or 
5 an animal, either alone, or in combination with one or more other modalities of therapy. 

It will be understood that, if desired, a composition as disclosed herein 
may be administered in combination with other agents as well, such as, e.g., other 
proteins or polypeptides or various pharmaceutically-active agents. In fact, there is 
virtually no limit to other components that may also be included, given that the 

10 additional agents do not cause a significant adverse effect upon contact with the target 
cells or host tissues. The compositions may thus be delivered along with various other 
agents as required in the particular instance. Such compositions may be purified from 
host cells or other biological sources, or alternatively may be chemically synthesized as 
described herein. Likewise, such compositions may further comprise substituted or 

1 5 derivatized RNA or DNA compositions. 

Therefore, in another aspect of the present invention, pharmaceutical 
compositions are provided comprising one or more of the polynucleotide, polypeptide, 
antibody, and/or T-cell compositions described herein in combination with a 
physiologically acceptable carrier. In certain preferred embodiments, the 

20 pharmaceutical compositions of the invention comprise immunogenic polynucleotide 
and/or polypeptide compositions of the invention for use in prophylactic and theraputic 
vaccine applications. Vaccine preparation is generally described in, for example, M.F. 
Powell and MJ. Newman, eds., "Vaccine Design (the subunit and adjuvant approach)," 
Plenum Press (NY, 1995). Generally, such compositions will comprise one or more 

25 polynucleotide and/or polypeptide compositions of the present invention in combination 
with one or more immunostimulants. 

It will be apparent that any of the pharmaceutical compositions described 
herein can contain pharmaceutically acceptable salts of the polynucleotides and 
polypeptides of the invention. Such salts can be prepared, for example, from 

30 pharmaceutically acceptable non-toxic bases, including organic bases {e.g., salts of 
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primary, secondary and tertiary amines and basic amino acids) and inorganic bases (e.g., 
sodium, potassium, lithium, ammonium, calcium and magnesium salts). 

In another embodiment, illustrative immunogenic compositions, e.g., 
vaccine compositions, of the present invention comprise DNA encoding one or more of 
5 the polypeptides as described above, such that the polypeptide is generated in situ. As 
noted above, the polynucleotide may be administered within any of a variety of delivery 
systems known to those of ordinary skill in the art. Indeed, numerous gene delivery 
techniques are well known in the art, such as those described by Rolland, Crit. Rev. 
Therap. Drug Carrier Systems 75:143-198, 1998, and references cited therein. 

10 Appropriate polynucleotide expression systems will, of course, contain the necessary 
regulatory DNA regulatory sequences for expression in a patient (such as a suitable 
promoter and terminating signal). Alternatively, bacterial delivery systems may involve 
the administration of a bacterium (such as Bacillus-Calmette-Guerriri) that expresses an 
immunogenic portion of the polypeptide on its cell surface or secretes such an epitope. 

15 Therefore, in certain embodiments, polynucleotides encoding 

immunogenic polypeptides described herein are introduced into suitable mammalian 
host cells for expression using any of a number of known viral-based systems. In one 
illustrative embodiment, retroviruses provide a convenient and effective platform for 
gene delivery systems. A selected nucleotide sequence encoding a polypeptide of the 

20 present invention can be inserted into a vector and packaged in retroviral particles using 
techniques known in the art. The recombinant virus can then be isolated and delivered 
to a subject. A number of illustrative retroviral systems have been described (e.g., U.S. 
Pat. No. 5,219,740; Miller and Rosman (1989) BioTechniques 7:980-990; Miller, A. D. 
(1990) Human Gene Therapy 1:5-14; Scarpa et al. (1991) Virology 180:849-852; Burns 

25 et al. (1993) Proc. Natl. Acad. Sci. USA 90:8033-8037; and Boris-Lawrie and Temin 
(1993) Cur. Opin. Genet. Develop. 3:102-109. 

In addition, a number of illustrative adenovirus-based systems have also 
been described. Unlike retroviruses which integrate into the host genome, adenoviruses 
persist extrachromosomally thus minimizing the risks associated with insertional 

30 mutagenesis (Haj-Ahmad and Graham (1986) J. Virol. 57:267-274; Bett et al. (1993) J. 
Virol. 67:5911-5921; Mittereder et al. (1994) Human Gene Therapy 5:717-729; Seth et 
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al. (1994) J. Virol. 68:933-940; Barr et al. (1994) Gene Therapy 1:51-58; Berkner, K. L. 
(1988) BioTechniques 6:616-629; and Rich et al. (1993) Human Gene Therapy 4:461- 
476). 

Various adeno-associated virus (AAV) vector systems have also been 
5 developed for polynucleotide delivery. AAV vectors can be readily constructed using 
techniques well known in the art. See, e.g., U.S. Pat. Nos. 5,173,414 and 5,139,941; 
International Publication Nos. WO 92/01070 and WO 93/03769; Lebkowski et al. 
(1988) Molec. Cell. Biol. 8:3988-3996; Vincent et al. (1990) Vaccines 90 (Cold Spring 
Harbor Laboratory Press); Carter, B. J. (1992) Current Opinion in Biotechnology 3:533- 

10 539; Muzyczka, N. (1992) Current Topics in Microbiol, and Immunol. 158:97-129; 
Kotin, R. M. (1994) Human Gene Therapy 5:793-801; Shelling and Smith (1994) Gene 
Therapy 1:165-169; and Zhou et al. (1994) J. Exp. Med. 179:1867-1875. 

Additional viral vectors useful for delivering the polynucleotides 
encoding polypeptides of the present invention by gene transfer include those derived 

15 from the pox family of viruses, such as vaccinia virus and avian poxvirus. By way of 
example, vaccinia virus recombinants expressing the novel molecules can be 
constructed as follows. The DNA encoding a polypeptide is first inserted into an 
appropriate vector so that it is adjacent to a vaccinia promoter and flanking vaccinia 
DNA sequences, such as the sequence encoding thymidine kinase (TK). This vector is 

20 then used to transfect cells which are simultaneously infected with vaccinia. 
Homologous recombination serves to insert the vaccinia promoter plus the gene 
encoding the polypeptide of interest into the viral genome. The resulting TK.sup.(-) 
recombinant can be selected by culturing the cells in the presence of 5- 
bromodeoxyuridine and picking viral plaques resistant thereto. 

25 A vaccinia-based infection/transfection system can be conveniently used 

to provide for inducible, transient expression or coexpression of one or more 
polypeptides described herein in host cells of an organism. In this particular system, 
cells are first infected in vitro with a vaccinia virus recombinant that encodes the 
bacteriophage T7 RNA polymerase. This polymerase displays exquisite specificity in 

30 that it only transcribes templates bearing T7 promoters. Following infection, cells are 
transfected with the polynucleotide or polynucleotides of interest, driven by a T7 
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promoter. The polymerase expressed in the cytoplasm from the vaccinia virus 
recombinant transcribes the transfected DNA into RNA which is then translated into 
polypeptide by the host translational machinery. The method provides for high level, 
transient, cytoplasmic production of large quantities of RNA and its translation 
5 products. See, e.g. , Elroy-Stein and Moss, Proc. Natl. Acad. Sci. USA (1990) 87:6743- 
6747; Fuerst et al. Proc. Natl. Acad. Sci. USA (1986) 83:8122-8126. 

Alternatively, avipoxviruses, such as the fowlpox and canarypox viruses, 
can also be used to deliver the coding sequences of interest. Recombinant avipox 
viruses, expressing immunogens from mammalian pathogens, are known to confer 

10 protective immunity when administered to non-avian species. The use of an Avipox 
vector is particularly desirable in human and other mammalian species since members 
of the Avipox genus can only productively replicate in susceptible avian species and 
therefore are not infective in mammalian cells. Methods for producing recombinant 
Avipoxviruses are known in the art and employ genetic recombination, as described 

15 above with respect to the production of vaccinia viruses. See, e.g., WO 91/12882; WO 
89/03429; and WO 92/03545. 

Any of a number of alphavirus vectors can also be used for delivery of 
polynucleotide compositions of the present invention, such as those vectors described in 
U.S. Patent Nos. 5,843,723; 6,015,686; 6,008,035 and 6,015,694. Certain vectors based 

20 on Venezuelan Equine Encephalitis (VEE) can also be used, illustrative examples of 
which can be found in U.S. Patent Nos. 5,505,947 and 5,643,576. 

Moreover, molecular conjugate vectors, such as the adenovirus chimeric 
vectors described in Michael et al. J. Biol. Chem. (1993) 268:6866-6869 and Wagner et 
al. Proc. Natl. Acad. Sci. USA (1992) 89:6099-6103, can also be used for gene delivery 

25 under the invention. 

Additional illustrative information on these and other known viral-based 
delivery systems can be found, for example, in Fisher-Hoch et al., Proc. Natl. Acad. Sci. 
USA 5(5:317-321, 1989; Flexner et al., Ann. NY. Acad. Sci. 569:86-103, 1989; Flexner 
et al, Vaccine 5:17-21, 1990; U.S. Patent Nos. 4,603,112, 4,769,330, and 5,017,487; 

30 WO 89/01973; U.S. Patent No. 4,777,127; GB 2,200,651; EP 0,345,242; 
WO 91/02805; Berkner, Biotechniques (5:616-627, 1988; Rosenfeld et al., Science 
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252:431-434, 1991; Kolls et al., Proc. Natl. Acad. Sci. USA Pi:215-219, 1994; 
Kass-Eisler et al., Proc. Natl Acad. Sci. USA 90: 11498-11502, 1993; Guzman et al., 
Circulation 55:2838-2848, 1993; and Guzman et al., Cir. Res. 75:1202-1207, 1993. 

In certain embodiments, a polynucleotide may be integrated into the 
5 genome of a target cell. This integration may be in the specific location and orientation 
via homologous recombination (gene replacement) or it may be integrated in a random, 
non-specific location (gene augmentation). In yet further embodiments, the 
polynucleotide may be stably maintained in the cell as a separate, episomal segment of 
DNA. Such polynucleotide segments or "episomes" encode sequences sufficient to 

10 permit maintenance and replication independent of or in synchronization with the host 
cell cycle. The manner in which the expression construct is delivered to a cell and 
where in the cell the polynucleotide remains is dependent on the type of expression 
construct employed. 

In another embodiment of the invention, a polynucleotide is 

15 administered/delivered as "naked" DNA, for example as described in Ulmer et al., 
Science 259: 1745- 1749, 1993 and reviewed by Cohen, Science 259: 1691-1 692, 1993. 
The uptake of naked DNA may be increased by coating the DNA onto biodegradable 
beads, which are efficiently transported into the cells. 

In still another embodiment, a composition of the present invention can 

20 be delivered via a particle bombardment approach, many of which have been described. 
In one illustrative example, gas-driven particle acceleration can be achieved with 
devices such as those manufactured by Powderject Pharmaceuticals PLC (Oxford, UK) 
and Powderject Vaccines Inc. (Madison, WI), some examples of which are described in 
U.S. Patent Nos. 5,846,796; 6,010,478; 5,865,796; 5,584,807; and EP Patent No. 0500 

25 799. This approach offers a needle-free delivery approach wherein a dry powder 
formulation of microscopic particles, such as polynucleotide or polypeptide particles, 
are accelerated to high speed within a helium gas jet generated by a hand held device, 
propelling the particles into a target tissue of interest. 

In a related embodiment, other devices and methods that may be useful 

30 for gas-driven needle-less injection of compositions of the present invention include 
those provided by Bioject, Inc. (Portland, OR), some examples of which are described 
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in U.S. Patent Nos. 4,790,824; 5,064,413; 5,312,335; 5,383,851; 5,399,163; 5,520,639 
and 5,993,412. 

According to another embodiment, the pharmaceutical compositions 
described herein will comprise one or more immunostimulants in addition to the 
5 immunogenic polynucleotide, polypeptide, antibody, T-cell and/or APC compositions 
of this invention. An immunostimulant refers to essentially any substance that enhances 
or potentiates an immune response (antibody and/or cell-mediated) to an exogenous 
antigen. One preferred type of immunostimulant comprises an adjuvant. Many 
adjuvants contain a substance designed to protect the antigen from rapid catabolism, 

10 such as aluminum hydroxide or mineral oil, and a stimulator of immune responses, such 
as lipid A, Bortadella pertussis or Mycobacterium tuberculosis derived proteins. 
Certain adjuvants are commercially available as, for example, Freund's Incomplete 
Adjuvant and Complete Adjuvant (Difco Laboratories, Detroit, MI); Merck Adjuvant 
65 (Merck and Company, Inc., Rahway, NJ); AS-2 (SmithKline Beecham, Philadelphia, 

15 PA); aluminum salts such as aluminum hydroxide gel (alum) or aluminum phosphate; 
salts of calcium, iron or zinc; an insoluble suspension of acylated tyrosine; acylated 
sugars; cationically or anionically derivatized polysaccharides; polyphosphazenes; 
biodegradable microspheres; monophosphoryl lipid A and quil A. Cytokines, such as 
GM-CSF, interleukin-2, -7, -12, and other like growth factors, may also be used as 

20 adjuvants. 

Within certain embodiments of the invention, the adjuvant composition 
is preferably one that induces an immune response predominantly of the Thl type. High 
levels of Thl-type cytokines {e.g., IFN-y, TNFa, IL-2 and IL-12) tend to favor the 
induction of cell mediated immune responses to an administered antigen. In contrast, 

25 high levels of Th2-type cytokines (e.g., IL-4, IL-5, IL-6 and IL-10) tend to favor the 
induction of humoral immune responses. Following application of a vaccine as 
provided herein, a patient will support an immune response that includes Thl- and Tb2- 
type responses. Within a preferred embodiment, in winch a response is predominantly 
Thl-type, the level of Thl-type cytokines will increase to a greater extent than the level 

30 of Th2-type cytokines. The levels of these cytokines may be readily assessed using 
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standard assays. For a review of the families of cytokines, see Mosmann and Coffman, 
Ann. Rev. Immunol 7:145-173, 1989. 

Certain preferred adjuvants for eliciting a predominantly Thl-type 
response include, for example, a combination of monophosphoryl lipid A, preferably 3- 
5 de-O-acylated monophosphoryl lipid A, together with an aluminum salt. MPL® 
adjuvants are available from Corixa Corporation (Seattle, WA; see, for example, US 
Patent Nos. 4,436,727; 4,877,611; 4,866,034 and 4,912,094). CpG-containing 
oligonucleotides (in which the CpG dinucleotide is unmethylated) also induce a 
predominantly Thl response. Such oligonucleotides are well known and are described, 

10 for example, in WO 96/02555, WO 99/33488 and U.S. Patent Nos. 6,008,200 and 
5,856,462. Immunostimulatory DNA sequences are also described, for example, by 
Sato et al., Science 273:352, 1996. Another preferred adjuvant comprises a saponin, 
such as Quil A, or derivatives thereof, including QS21 and QS7 (Aquila 
Biopharmaceuticals Inc., Framingham, MA); Escin; Digitonin; or Gypsophila or 

15 Chenopodium quinoa saponins . Other preferred formulations include more than one 
saponin in the adjuvant combinations of the present invention, for example 
combinations of at least two of the following group comprising QS21, QS7, Quil A, (3- 
escin, or digitonin. 

Alternatively the saponin formulations may be combined with vaccine 

20 vehicles composed of chitosan or other polycationic polymers, polylactide and 
polylactide-co-glycolide particles, poly-N-acetyl glucosamine-based polymer matrix, 
particles composed of polysaccharides or chemically modified polysaccharides, 
liposomes and lipid-based particles, particles composed of glycerol monoesters, etc. The 
saponins may also be formulated in the presence of cholesterol to form particulate 

25 structures such as liposomes or ISCOMs. Furthermore, the saponins may be formulated 
together with a polyoxyethylene ether or ester, in either a non-particulate solution or 
suspension, or in a particulate structure such as a paucilamelar liposome or ISCOM. The 
saponins may also be formulated with excipients such as Carbopol R to increase 
viscosity, or may be formulated in a dry powder form with a powder excipient such as 

30 lactose. 
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In one preferred embodiment, the adjuvant system includes the 
combination of a monophosphoryl lipid A and a saponin derivative, such as the 
combination of QS21 and 3D-MPL® adjuvant, as described in WO 94/00153, or a less 
reactogenic composition where the QS21 is quenched with cholesterol, as described in 
5 WO 96/33739. Other preferred formulations comprise an oil-in- water emulsion and 
tocopherol. Another particularly preferred adjuvant formulation employing QS21, 3D- 
MPL® adjuvant and tocopherol in an oil-in-water emulsion is described in WO 
95/17210. 

Another enhanced adjuvant system involves the combination of a CpG- 
10 containing oligonucleotide and a saponin derivative particularly the combination of 
CpG and QS21 is disclosed in WO 00/09159. Preferably the formulation additionally 
comprises an oil in water emulsion and tocopherol. 

Additional illustrative adjuvants for use in the pharmaceutical 
compositions of the invention include Montanide ISA 720 (Seppic, France), SAF 
15 (Chiron, California, United States), ISCOMS (CSL), MF-59 (Chiron), the SB AS series 
of adjuvants {e.g., SBAS-2 or SBAS-4, available from SmithKline Beecham, Rixensart, 
Belgium), Detox (Enhanzyn®) (Corixa, Hamilton, MT), RC-529 (Corixa, Hamilton, 
MT) and other aminoalkyl glucosaminide 4-phosphates (AGPs), such as those described 
in pending U.S. Patent Application Serial Nos. 08/853,826 and 09/074,720, the 
20 disclosures of which are incorporated herein by reference in their entireties, and 
polyoxyethylene ether adjuvants such as those described in WO 99/52549A1. 

Other preferred adjuvants include adjuvant molecules of the general 

formula 

(I): HO(CH 2 CH 2 0)„-A-R, 
25 wherein, n is 1-50, A is a bond or -C(O)-, R is C1-50 alkyl or Phenyl C1-50 alkyl. 

One embodiment of the present invention consists of a vaccine 
formulation comprising a polyoxyethylene ether of general formula (I), wherein n is 
between 1 and 50, preferably 4-24, most preferably 9; the R component is Ci_ 5 o, 
preferably C4-C20 alkyl and most preferably Cn alkyl, and i is a bond. The 
30 concentration of the polyoxyethylene ethers should be in the range 0.1-20%, preferably 
from 0.1-10%, and most preferably in the range 0.1-1%. Preferred polyoxyethylene 
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ethers are selected from the following group: polyoxyethylene-9-lauryl ether, 
polyoxyethylene-9-steoryl ether, polyoxyethylene-8-steoryl ether, polyoxyethylene-4- 
lauryl ether, polyoxyethylene-35-lauryl ether, and polyoxyethylene-23-lauryl ether. 
Polyoxyethylene ethers such as polyoxyethylene lauryl ether are described in the Merck 
5 index (12 th edition: entry 7717). These adjuvant molecules are described in WO 
99/52549. 

The polyoxyethylene ether according to the general formula (I) above 
may, if desired, be combined with another adjuvant. For example, a preferred adjuvant 
combination is preferably with CpG as described in the pending UK patent application 

10 GB 9820956.2. 

According to another embodiment of this invention, an immunogenic 
composition described herein is delivered to a host via antigen presenting cells (APCs), 
such as dendritic cells, macrophages, B cells, monocytes and other cells that may be 
engineered to be efficient APCs. Such cells may, but need not, be genetically modified 

15 to increase the capacity for presenting the antigen, to improve activation and/or 
maintenance of the T cell response, to have anti-tumor effects per se and/or to be 
immunologically compatible with the receiver (i.e., matched HLA haplotype). APCs 
may generally be isolated from any of a variety of biological fluids and organs, 
including tumor and peritumoral tissues, and may be autologous, allogeneic, syngeneic 

20 or xenogeneic cells. 

Certain preferred embodiments of the present invention use dendritic 
cells or progenitors thereof as antigen-presenting cells. Dendritic cells are highly potent 
APCs (Banchereau and Steinman, Nature 392:245-251, 1998) and have been shown to 
be effective as a physiological adjuvant for eliciting prophylactic or therapeutic 

25 antitumor immunity (see Timmerman and Levy, Ann. Rev. Med. 5(9:507-529, 1999). In 
general, dendritic cells may be identified based on their typical shape (stellate in situ, 
with marked cytoplasmic processes (dendrites) visible in vitro), their ability to take up, 
process and present antigens with high efficiency and their ability to activate naive T 
cell responses. Dendritic cells may, of course, be engineered to express specific cell- 

30 surface receptors or ligands that are not commonly found on dendritic cells in vivo or ex 
vivo, and such modified dendritic cells are contemplated by the present invention. As 
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an alternative to dendritic cells, secreted vesicles antigen-loaded dendritic cells (called 
exosomes) may be used within a vaccine {see Zitvogel et al., Nature Med. 4:594-600, 
1998). 

Dendritic cells and progenitors may be obtained from peripheral blood, 
5 bone marrow, tumor-infiltrating cells, peritumoral tissues-infiltrating cells, lymph 
nodes, spleen, skin, umbilical cord blood or any other suitable tissue or fluid. For 
example, dendritic cells may be differentiated ex vivo by adding a combination of 
cytokines such as GM-CSF, IL-4, IL-13 and/or TNFa to cultures of monocytes 
harvested from peripheral blood. Alternatively, CD34 positive cells harvested from 

10 peripheral blood, umbilical cord blood or bone marrow may be differentiated into 
dendritic cells by adding to the culture medium combinations of GM-CSF, IL-3, TNFa, 
CD40 ligand, LPS, flt3 ligand and/or other compound(s) that induce differentiation, 
maturation and proliferation of dendritic cells. 

Dendritic cells are conveniently categorized as "immature" and "mature" 

15 cells, which allows a simple way to discriminate between two well characterized 
phenotypes. However, this nomenclature should not be construed to exclude all 
possible intermediate stages of differentiation. Immature dendritic cells are 
characterized as APC with a high capacity for antigen uptake and processing, which 
correlates with the high expression of Fey receptor and marmose receptor. The mate 

20 phenotype is typically characterized by a lower expression of these markers, but a high 
expression of cell surface molecules responsible for T cell activation such as class I and 
class II MHC, adhesion molecules {e.g., CD54 and CD 11) and costimulatory molecules 
{e.g., CD40, CD80, CD86 and 4-1BB). 

APCs may generally be transfected with a polynucleotide of the 

25 invention (or portion or other variant thereof) such that the encoded polypeptide, or an 
immunogenic portion thereof, is expressed on the cell surface. Such transfection may 
take place ex vivo, and a pharmaceutical composition comprising such transfected cells 
may then be used for therapeutic purposes, as described herein. Alternatively, a gene 
delivery vehicle that targets a dendritic or other antigen presenting cell may be 

30 administered to a patient, resulting in transfection that occurs in vivo. In vivo and ex 
vivo transfection of dendritic cells, for example, may generally be performed using any 
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methods known in the art, such as those described in WO 97/24447, or the gene gun 
approach described by Mahvi et al., Immunology and cell Biology 75:456-460, 1997. 
Antigen loading of dendritic cells may be achieved by incubating dendritic cells or 
progenitor cells with the tumor polypeptide, DNA (naked or within a plasmid vector) or 
5 RNA; or with antigen-expressing recombinant bacterium or viruses {e.g., vaccinia, 
fowlpox, adenovirus or lentivirus vectors). Prior to loading, the polypeptide may be 
covalently conjugated to an immunological partner that provides T cell help {e.g., a 
carrier molecule). Alternatively, a dendritic cell may be pulsed with a non-conjugated 
immunological partner, separately or in the presence of the polypeptide. 

10 While any suitable carrier known to those of ordinary skill in the art may 

be employed in the pharmaceutical compositions of this invention, the type of carrier 
will typically vary depending on the mode of administration. Compositions of the 
present invention may be formulated for any appropriate manner of administration, 
including for example, topical, oral, nasal, mucosal, intravenous, intracranial, 

1 5 intraperitoneal, subcutaneous and intramuscular administration. 

Carriers for use within such pharmaceutical compositions are 
biocompatible, and may also be biodegradable. In certain embodiments, the 
formulation preferably provides a relatively constant level of active component release. 
In other embodiments, however, a more rapid rate of release immediately -upon 

20 administration may be desired. The formulation of such compositions is well within the 
level of ordinary skill in the art using known techniques. Illustrative carriers useful in 
this regard include microparticles of poly(lactide-co-glycolide), polyacrylate, latex, 
starch, cellulose, dextran and the like. Other illustrative delayed-release carriers 
include supramolecular biovectors, which comprise a non-liquid hydrophilic core {e.g., 

25 a cross-linked polysaccharide or oligosaccharide) and, optionally, an external layer 
comprising an amphiphilic compound, such as a phospholipid {see e.g., U.S. Patent No. 
5,151,254 and PCT applications WO 94/20078, WO/94/23701 and WO 96/06638). The 
amount of active compound contained within a sustained release formulation depends 
upon the site of implantation, the rate and expected duration of release and the nature of 

30 the condition to be treated or prevented. 
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In another illustrative embodiment, biodegradable microspheres (e.g., 
polylactate polyglycolate) are employed as carriers for the compositions of this 
invention. Suitable biodegradable microspheres are disclosed, for example, in U.S. 
Patent Nos. 4,897,268; 5,075,109; 5,928,647; 5,811,128; 5,820,883; 5,853,763; 
5 5,814,344, 5,407,609 and 5,942,252. Modified hepatitis B core protein carrier systems, 
such as described in WO/99 40934, and references cited therein, will also be useful for 
many applications. Another illustrative carrier/delivery system employs a carrier 
comprising particulate-protein complexes, such as those described in U.S. Patent No. 
5,928,647, which are capable of inducing a class I-restricted cytotoxic T lymphocyte 

1 0 responses in a host. 

The pharmaceutical compositions of the invention will often further 
comprise one or more buffers (e.g., neutral buffered saline or phosphate buffered 
saline), carbohydrates (e.g., glucose, mannose, sucrose or dextrans), mannitol, proteins, 
polypeptides or amino acids such as glycine, antioxidants, bacteriostats, chelating 

15 agents such as EDTA or glutathione, adjuvants (e.g., aluminum hydroxide), solutes that 
render the formulation isotonic, hypotonic or weakly hypertonic with the blood of a 
recipient, suspending agents, thickening agents and/or preservatives. Alternatively, 
compositions of the present invention may be formulated as a lyophilizate. 

The pharmaceutical compositions described herein may be presented in 

20 unit-dose or multi-dose containers, such as sealed ampoules or vials. Such containers 
are typically sealed in such a way to preserve the sterility and stability of the 
formulation until use. In general, formulations may be stored as suspensions, solutions 
or emulsions in oily or aqueous vehicles. Alternatively, a pharmaceutical composition 
may be stored in a freeze-dried condition requiring only the addition of a sterile liquid 

25 carrier immediately prior to use. 

The development of suitable dosing and treatment regimens for using the 
particular compositions described herein in a variety of treatment regimens, including 
e.g., oral, parenteral, intravenous, intranasal, and intramuscular administration and 
formulation, is well known in the art, some of which are briefly discussed below for 

30 general purposes of illustration. 
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In certain applications, the pharmaceutical compositions disclosed herein 
may be delivered via oral administration to an animal. As such, these compositions 
may be formulated with an inert diluent or with an assimilable edible carrier, or they 
may be enclosed in hard- or soft-shell gelatin capsule, or they may be compressed into 
5 tablets, or they may be incorporated directly with the food of the diet. „ 

The active compounds may even be incorporated with excipients and 
used in the form of ingestible tablets, buccal tables, troches, capsules, elixirs, 
suspensions, syrups, wafers, and the like (see, for example, Mathiowitz et ah, Nature 
1997 Mar 27;386(6623):410-4; Hwang et a!., Crit Rev Ther Drug Carrier Syst 

10 1998;15(3):243-84; U. S. Patent 5,641,515; U. S. Patent 5,580,579 and U. S. Patent 
5,792,451). Tablets, troches, pills, capsules and the like may also contain any of a 
variety of additional components, for example, a binder, such as gum tragacanth, acacia, 
cornstarch, or gelatin; excipients, such as dicalcium phosphate; a disintegrating agent, 
such as corn starch, potato starch, alginic acid and the like; a lubricant, such as 

15 magnesium stearate; and a sweetening agent, such as sucrose, lactose or saccharin may 
be added or a flavoring agent, such as peppermint, oil of wintergreen, or cherry 
flavoring. When the dosage unit form is a capsule, it may contain, in addition to 
materials of the above type, a liquid carrier. Various other materials may be present as 
coatings or to otherwise modify the physical form of the dosage unit. For instance, 

20 tablets, pills, or capsules may be coated with shellac, sugar, or both. Of course, any 
material used in preparing any dosage unit form should be pharmaceutically pure and 
substantially non-toxic in the amounts employed. In addition, the active compounds 
may be incorporated into sustained-release preparation and formulations. 

Typically, these formulations will contain at least about 0.1% of the 

25 active compound or more, although the percentage of the active ingredient(s) may, of 
course, be varied and may conveniently be between about 1 or 2% and about 60% or 
70% or more of the weight or volume of the total formulation. Naturally, the amount of 
active compound(s) in each therapeutically useful composition may be prepared is such 
a way that a suitable dosage will be obtained in any given unit dose of the compound. 

30 Factors such as solubility, bioavailability, biological half-life, route of administration, 
product shelf life, as well as other pharmacological considerations will be contemplated 
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by one skilled in the art of preparing such pharmaceutical formulations, and as such, a 
variety of dosages and treatment regimens may be desirable. 

For oral administration the compositions of the present invention may 
alternatively be incorporated with one or more excipients in the form of a mouthwash, 
5 dentifrice, buccal tablet, oral spray, or sublingual orally-administered formulation. 
Alternatively, the active ingredient may be incorporated into an oral solution such as 
one containing sodium borate, glycerin and potassium bicarbonate, or dispersed in a 
dentifrice, or added in a therapeutically-effective amount to a composition that may 
include water, binders, abrasives, flavoring agents, foaming agents, and humectants. 

10 Alternatively the compositions may be fashioned into a tablet or solution form that may 
be placed under the tongue or otherwise dissolved in the mouth. 

In certain circumstances it will be desirable to deliver the pharmaceutical 
compositions disclosed herein parenterally, intravenously, intramuscularly, or even 
intraperitoneally. Such approaches are well known to the skilled artisan, some of which 

15 are further described, for example, in U. S. Patent 5,543,158; U. S. Patent 5,641,515 
and U. S. Patent 5,399,363. In certain embodiments, solutions of the active compounds 
as free base or pharmacologically acceptable salts may be prepared in water suitably 
mixed with a surfactant, such as hydroxypropylcellulose. Dispersions may also be 
prepared in glycerol, liquid polyethylene glycols, and mixtures thereof and in oils. 

20 Under ordinary conditions of storage and use, these preparations generally will contain a 
preservative to prevent the growth of microorganisms. 

Illustrative pharmaceutical forms suitable for injectable use include 
sterile aqueous solutions or dispersions and sterile powders for the extemporaneous 
preparation of sterile injectable solutions or dispersions (for example, see U. S. Patent 

25 5,466,468). In all cases the form must be sterile and must be fluid to the extent that 
easy syringability exists. It must be stable under the conditions of manufacture and 
storage and must be preserved against the contaminating action of microorganisms, 
such as bacteria and fungi. The carrier can be a solvent or dispersion medium 
containing, for example, water, ethanol, polyol (e.g., glycerol, propylene glycol, and 

30 liquid polyethylene glycol, and the like), suitable mixtures thereof, and/or vegetable 
oils. Proper fluidity may be maintained, for example, by the use of a coating, such as 
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lecithin, by the maintenance of the required particle size in the case of dispersion and/or 
by the use of surfactants. The prevention of the action of microorganisms can be 
facilitated by various antibacterial and antifungal agents, for example, parabens, 
chlorobutanol, phenol, sorbic acid, thimerosal, and the like. In many cases, it will be 
5 preferable to include isotonic agents, for example, sugars or sodium chloride. 
Prolonged absorption of the injectable compositions can be brought about by the use in 
the compositions of agents delaying absorption, for example, aluminum monostearate 
and gelatin. 

In one embodiment, for parenteral administration in an aqueous solution, 

10 the solution should be suitably buffered if necessary and the liquid diluent first rendered 
isotonic with sufficient saline or glucose. These particular aqueous solutions are 
especially suitable for intravenous, intramuscular, subcutaneous and intraperitoneal 
administration. In this connection, a sterile aqueous medium that can be employed will 
be known to those of skill in the art in light of the present disclosure. For example, one 

15 dosage may be dissolved in 1 ml of isotonic NaCl solution and either added to 1000 ml 
of hypodermoclysis fluid or injected at the proposed site of infusion, (see for example, 
"Remington's Pharmaceutical Sciences" 15th Edition, pages 1035-1038 and 1570- 
1580). Some variation in dosage will necessarily occur depending on the condition of 
the subject being treated. Moreover, for human administration, preparations will of 

20 course preferably meet sterility, pyrogenicity, and the general safety and purity 
standards as required by FDA Office of Biologies standards. 

In another embodiment of the invention, the compositions disclosed 
herein may be formulated in a neutral or salt form.' Illustrative 
pharmaceutically-acceptable salts include the acid addition salts (formed with the free 

25 amino groups of the protein) and which are formed with inorganic acids such as, for 
example, hydrochloric or phosphoric acids, or such organic acids as acetic, oxalic, 
tartaric, mandelic, and the like. Salts formed with the free carboxyl groups can also be 
derived from inorganic bases such as, for example, sodium, potassium, ammonium, 
calcium, or ferric hydroxides, and such organic bases as isopropylamine, 

30 trimethylamine, histidine, procaine and the like. Upon formulation, solutions will be 
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administered in a manner compatible with the dosage formulation and in such amount 
as is therapeutically effective. 

The carriers can further comprise any and all solvents, dispersion media, 
vehicles, coatings, diluents, antibacterial and antifungal agents, isotonic and absorption 
5 delaying agents, buffers, carrier solutions, suspensions, colloids, and the like. The use 
of such media and agents for pharmaceutical active substances is well known in the art. 
Except insofar as any conventional media or agent is incompatible with the active 
ingredient, its use in the therapeutic compositions is contemplated. Supplementary 
active ingredients can also be incorporated into the compositions. The phrase 

10 "pharmaceutically-acceptable" refers to molecular entities and compositions that do not 
produce an allergic or similar untoward reaction when administered to a human. 

In certain embodiments, the pharmaceutical compositions may be 
delivered by intranasal sprays, inhalation, and/or other aerosol delivery vehicles. 
Methods for delivering genes, nucleic acids, and peptide compositions directly to the 

15 lungs via nasal aerosol sprays has been described, e.g., in U. S. Patent 5,756,353 and U. 
S. Patent 5,804,212. Likewise, the delivery of drugs using intranasal thicroparticle 
resins (Takenaga et ah, J Controlled Release 1998 Mar 2;52(l-2):81-7) and 
lysophosphatidyl-glycerol compounds (U. S. Patent 5,725,871) are also well-known in 
the pharmaceutical arts. Likewise, illustrative transmucosal drug delivery in the form of 

20 a polytefrafluoroetheylene support matrix is described in U. S. Patent 5,780,045. 

In certain embodiments, liposomes, nanocapsules, microparticles, lipid 
particles, vesicles, and the like, are used for the introduction of the compositions of the 
present invention into suitable host cells/organisms. In particular, the compositions of 
the present invention may be formulated for delivery either encapsulated in a lipid 

25 particle, a liposome, a vesicle, a nanosphere, or a nanoparticle or the like. Alternatively, 
compositions of the present invention can be bound, either covalently or non-covalently, 
to the surface of such carrier vehicles. 

The formation and use of liposome and liposome-like preparations as 
potential drug carriers is generally known to those of skill in the art (see for example, 

30 Lasic, Trends Biotechnol 1998 Jul;16(7):307-21; Takakura, Nippon Rinsho 1998 
Mar;56(3):691-5; Chandran et al, Indian J Exp Biol. 1997 Aug;35(8):801-9; Margalit, 
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Grit Rev Ther Drug Carrier Syst. 1995;12(2-3):233-61; U.S. Patent 5,567,434; U.S. 
Patent 5,552,157; U.S. Patent 5,565,213; U.S. Patent 5,738,868 and U.S. Patent 
5,795,587, each specifically incorporated herein by reference in its entirety). 

Liposomes have been used successfully with a number of cell types that 
5 are normally difficult to transfect by other procedures, including T cell suspensions, 
primary hepatocyte cultures and PC 12 cells (Renneisen et al, J Biol Chem. 1990 Sep 
25;265(27):16337-42; Muller et al, DNA Cell Biol. 1990 Apr;9(3):221-9). In addition, 
liposomes are free of the DNA length constraints that are typical of viral-based delivery 
systems. Liposomes have been used effectively to introduce genes, various drugs, 

10 radiotherapeutic agents, enzymes, viruses, transcription factors, allosteric effectors and 
the like, into a variety of cultured cell lines and animals. Furthermore, he use of 
liposomes does not appear to be associated with autoimmune responses or unacceptable 
toxicity after systemic delivery. 

In certain embodiments, liposomes are formed from phospholipids that 

15 are dispersed in an aqueous medium and spontaneously form multilamellar concentric 
bilayer vesicles (also termed multilamellar vesicles (MLVs). 

Alternatively, in other embodiments, the invention provides for 
pharmaceutically-acceptable nanocapsule formulations of the compositions of the 
present invention. Nanocapsules can generally entrap compounds in a stable and 

20 reproducible way (see, for example, Quintanar-Guerrero et al., Drug Dev Ind Pharm. 
1998 Dec;24(12):l 113-28). To avoid side effects due to intracellular polymeric 
overloading, such ultrafine particles (sized around 0.1 um) may be designed using 
polymers able to be degraded in vivo. Such particles can be made as described, for 
example, by Couvreur et al, Crit Rev Ther Drug Carrier Syst. 1988;5(l):l-20; zur 

25 Muhlen et al, Eur J Pharm Biopharm. 1998 Mar;45(2):149-55; Zambaux et al. J 
Controlled Release. 1998 Jan 2;50(l-3):3 1-40; andU. S. Patent 5,145,684. 

Cancer Therapeutic Methods 

Immunologic approaches to cancer therapy are based on the recognition 
that cancer cells can often evade the body's defenses against aberrant or foreign cells 
30 and molecules, and that these defenses might be therapeutically stimulated to regain the 
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lost ground, e.g. pgs. 623-648 in Klein, Immunology (Wiley-Interscience, New York, 
1982). Numerous recent observations that various immune effectors can directly or 
indirectly inhibit growth of tumors has led to renewed interest in this approach to cancer 
therapy, e.g. Jager, et al., Oncology 2001;60(l):l-7; Renner, et al., Ann Hematol 2000 
5 Dec;79(12):651-9. 

Four-basic cell types whose function has been associated with antitumor 
cell immunity and the elimination of tumor cells from the body are: i) B-lymphocytes 
which secrete immunoglobulins into the blood plasma for identifying and labeling the 
nonself invader cells; ii) monocytes which secrete the complement proteins that are 

10 responsible for lysing and processing the immunoglobulin-coated target invader cells; 
iii) natural killer lymphocytes having two mechanisms for the destruction of tumor 
cells, antibody-dependent cellular cytotoxicity and natural killing; and iv) T- 
lymphocytes possessing antigen-specific receptors and having the capacity to recognize 
a tumor cell carrying complementary marker molecules (Schreiber, H., 1989, in 

1 5 Fundamental Immunology (ed). W. E. Paul, pp. 923-955). 

Cancer immunotherapy generally focuses on inducing humoral immune 
responses, cellular immune responses, or both. Moreover, it is well established that 
induction of CD4 + T helper cells is necessary in order to secondarily induce either 
antibodies or cytotoxic CD8 + T cells. Polypeptide antigens that are selective or ideally 

20 specific for cancer cells, particularly lung cancer cells, offer a powerful approach for 
inducing immune responses against lung cancer, and are an important aspect of the 
present invention. 

Therefore, in further aspects of the present invention, the pharmaceutical 
compositions described herein may be used for the treatment of cancer, particularly for 

25 the immunotherapy of lung cancer. Within such methods, the pharmaceutical 
compositions described herein are administered to a patient, typically a warm-blooded 
animal, preferably a human. A patient may or may not be afflicted with cancer. 
Accordingly, the above pharmaceutical compositions may be used to prevent the 
development of a cancer or to treat a patient afflicted with a cancer. Pharmaceutical 

30 compositions and vaccines may be administered either prior to or following surgical 
removal of primary tumors and/or treatment such as administration of radiotherapy or 
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conventional chemotherapeutic drugs. As discussed above, administration of the 
pharmaceutical compositions may be by any suitable method, including administration 
by intravenous, intraperitoneal, intramuscular, subcutaneous, intranasal, intradermal, 
anal, vaginal, topical and oral routes. 
5 Within certain embodiments, immunotherapy may be active 

immunotherapy, in which treatment relies on the in vivo stimulation of the endogenous 
host immune system to react against tumors with the administration of immune 
response-modifying agents (such as polypeptides and polynucleotides as provided 
herein). 

10 Within other embodiments, immunotherapy may be passive 

immunotherapy, in which treatment involves the delivery of agents with established 
tumor-immune reactivity (such as effector cells or antibodies) that can directly or 
indirectly mediate antitumor effects and does not necessarily depend on an intact host 
immune system. Examples of effector cells include T cells as discussed above, T 

15 lymphocytes (such as CD8 + cytotoxic T lymphocytes and CD4 + T-helper tumor- 
infiltrating lymphocytes), killer cells (such as Natural Killer cells and lymphokine- 
activated killer cells), B cells and antigen-presenting cells (such as dendritic cells and 
macrophages) expressing a polypeptide provided herein. T cell receptors and antibody 
receptors specific for the polypeptides recited herein may be cloned, expressed and 

20 transferred into other vectors or effector cells for adoptive immunotherapy. The 
polypeptides provided herein may also be used to generate antibodies or anti-idiotypic 
antibodies (as described above and in U.S. Patent No. 4,918,164) for passive 
immunotherapy. 

Monoclonal antibodies may be labeled with any of a variety of labels for 
25 desired selective usages in detection, diagnostic assays or therapeutic applications (as 
described in U.S. Patent Nos. 6,090,365; 6,015,542; 5,843,398; 5,595,721; and 
4,708,930, hereby incorporated by reference in their entirety as if each was incorporated 
individually). In each case, the binding of the labelled monoclonal antibody to the 
determinant site of the antigen will signal detection or delivery of a particular 
30 therapeutic agent to the antigenic determinant on the non-normal cell. A further object 
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of this invention is to provide the specific monoclonal antibody suitably labelled for 
achieving such desired selective usages thereof. 

Effector cells may generally be obtained in sufficient quantities for 
adoptive immunotherapy by growth in vitro, as described herein. Culture conditions for 
5 expanding single antigen-specific effector cells to several billion in number with 
retention of antigen recognition in vivo are well known in the art. Such in vitro culture 
conditions typically use intermittent stimulation with antigen, often in the presence of 
cytokines (such as IL-2) and non-dividing feeder cells. As noted above, 
immunoreactive polypeptides as provided herein may be used to rapidly expand 

10 antigen-specific T cell cultures in order to generate a sufficient number of cells for 
immunotherapy. In particular, antigen-presenting cells, such as dendritic, macrophage, 
monocyte, fibroblast and/or B cells, may be pulsed with immunoreactive polypeptides 
or transfected with one or more polynucleotides using standard techniques well known 
in the art. For example, antigen-presenting cells can be transfected with a 

15 polynucleotide having a promoter appropriate for increasing expression in a 
recombinant virus or other expression system. Cultured effector cells for use in therapy 
must be able to grow and distribute widely, and to survive long term in vivo. Studies 
have shown that cultured effector cells can be induced to grow in vivo and to survive 
long term in substantial numbers by repeated stimulation with antigen supplemented 

20 with IL-2 (see, for example, Cheever et al., Immunological Reviews 157:111, 1997). 

Alternatively, a vector expressing a polypeptide recited herein may be 
introduced into antigen presenting cells taken from a patient and clonally propagated ex 
vivo for transplant back into the same patient. Transfected cells may be reintroduced 
into the patient using any means known in the art, preferably in sterile form by 

25 intravenous, intracavitary, intraperitoneal or intratumor administration. 

Routes and frequency of administration of the therapeutic compositions 
described herein, as well as dosage, will vary from individual to individual, and may be 
readily established using standard techniques. In general, the pharmaceutical 
compositions and vaccines may be administered by injection (e.g., intracutaneous, 

30 intramuscular, intravenous or subcutaneous), intranasally (e.g., by aspiration) or orally. 
Preferably, between 1 and 10 doses may be administered over a 52 week period. 
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Preferably, 6 doses are administered, at intervals of 1 month, and booster vaccinations 
may be given periodically thereafter. Alternate protocols may be appropriate for 
individual patients. A suitable dose is an amount of a compound that, when 
administered as described above, is capable of promoting an anti-tumor immune 
5 response, and is at least 10-50% above the basal (i.e., untreated) level. Such response 
can be monitored by measuring the anti-tumor antibodies in a patient or by vaccine- 
dependent generation of cytolytic effector cells capable of killing the patient's tumor 
cells in vitro. Such vaccines should also be capable of causing an immune response that 
leads to an improved clinical outcome (e.g., more frequent remissions, complete or 

10 partial or longer disease-free survival) in vaccinated patients as compared to non- 
vaccinated patients. In general, for pharmaceutical compositions and vaccines 
comprising one or more polypeptides, the amount of each polypeptide present in a dose 
ranges from about 25 jug to 5 mg per kg of host. Suitable dose sizes will vary with the 
size of the patient, but will typically range from about 0.1 mL to about 5 mL. 

15. In general, an appropriate dosage and treatment regimen provides the 

active compound(s) in an amount sufficient to provide therapeutic and/or prophylactic 
benefit. Such a response can be monitored by establishing an improved clinical 
outcome (e.g., more frequent remissions, complete or partial, or longer disease-free 
survival) in treated patients as compared to non-treated patients. Increases in 

20 preexisting immune responses to a tumor protein generally correlate with an improved 
clinical outcome. Such immune responses may generally be evaluated using standard 
proliferation, cytotoxicity or cytokine assays, which may be performed using samples 
obtained from a patient before and after treatment. 



Cancer Detection and Diagnostic Compositions, Methods and Kits 
25 In general, a cancer may be detected in a patient based on the presence of 

one or more lung tumor proteins and/or polynucleotides encoding such proteins in a 
biological sample (for example, blood, sera, sputum urine and/or tumor biopsies) 
obtained from the patient. In other words, such proteins may be used as markers to 
indicate the presence or absence of a cancer such as lung cancer. In addition, such 
30 proteins may be useful for the detection of other cancers. The binding agents provided 
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herein generally permit detection of the level of antigen that binds to the agent in the 
biological sample. 

Polynucleotide primers and probes may be used to detect the level of 
mRNA encoding a tumor protein, which is also indicative of the presence or absence of 
5 a cancer. In general, a tumor sequence should be present at a level that is at least two- 
fold, preferably three-fold, and more preferably five-fold or higher in tumor tissue than 
in normal tissue of the same type from which the tumor arose. Expression levels of a 
particular tumor sequence in tissue types different from that in which the tumor arose 
are irrelevant in certain diagnostic embodiments since the presence of tumor cells can 

10 be confirmed by observation of predetermined differential expression levels, e.g., 2- 
fold, 5-fold, etc, in tumor tissue to expression levels in normal tissue of the same type. 

Other differential expression patterns can be utilized advantageously for 
diagnostic purposes. For example, in one aspect of the invention, overexpression of a 
tumor sequence in tumor tissue and normal tissue of the same type, but not in other 

15 normal tissue types, e.g. PBMCs, can be exploited diagnostically. In this case, the 
presence of metastatic tumor cells, for example in a sample taken from the circulation 
or some other tissue site different from that in which the tumor arose, can be identified 
and/or confirmed by detecting expression of the tumor sequence in the sample, for 
example using RT-PCR analysis. In many instances, it will be desired to enrich for 

20 tumor cells in the sample of interest, e.g., PBMCs, using cell capture or other like 
techniques. 

There are a variety of assay formats known to those of ordinary skill in 
the art for using a binding agent to detect polypeptide markers in a sample. See, e.g., 
Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 

25 1988. In general, the presence or absence of a cancer in a patient may be determined by 
(a) contacting a biological sample obtained from a patient with a binding agent; (b) 
detecting in the sample a level of polypeptide that binds to the binding agent; and (c) 
comparing the level of polypeptide with a predetermined cut-off value. 

In a preferred embodiment, the assay involves the use of binding agent 

30 immobilized on a solid support to bind to and remove the polypeptide from the 
remainder of the sample. The bound polypeptide may then be detected using a detection 

100 



WO 02/47534 



PCT7US01/47576 



reagent that contains a reporter group and specifically binds to the binding 
agent/polypeptide complex. Such detection reagents may comprise, for example, a 
binding agent that specifically binds to the polypeptide or an antibody or other agent 
that specifically binds to the binding agent, such as an anti-immunoglobulin, protein G, 
5 protein A or a lectin. Alternatively, a competitive assay may be utilized, in which a 
polypeptide is labeled with a reporter group and allowed to bind to the immobilized 
binding agent after incubation of the binding agent with the sample. The extent to 
which components of the sample inhibit the binding of the labeled polypeptide to the 
binding agent is indicative of the reactivity of the sample with the immobilized binding 
10 agent. Suitable polypeptides for use within such assays include full length lung tumor 
proteins and polypeptide portions thereof to which the binding agent binds, as described 
above. 

The solid support may be any material known to those of ordinary skill 
in the art to which the tumor protein may be attached. For example, the solid support 

15 may be a test well in a microtiter plate or a nitrocellulose or other suitable membrane. 
Alternatively, the support may be a bead or disc, such as glass, fiberglass, latex or a 
plastic material such as polystyrene or polyvinylchloride. The support may also be a 
magnetic particle or a fiber optic sensor, such as those disclosed, for example, in U.S. 
Patent No. 5,359,681. The binding agent may be immobilized on the solid support 

20 using a variety of techniques known to those of skill in the art, which are amply 
described in the patent and scientific literature. In the context of the present invention, 
the term "immobilization" refers to both noncovalent association, such as adsorption, 
and covalent attachment (which may be a direct linkage between the agent and 
functional groups on the support or may be a linkage by way of a cross-linking agent). 

25 Immobilization by adsorption to a well in a microtiter plate or to a membrane is 
preferred. In such cases, adsorption may be achieved by contacting the binding agent, in 
a suitable buffer, with the solid support for a suitable amount of time. The contact time 
varies with temperature, but is typically between about 1 hour and about 1 day. In 
general, contacting a well of a plastic microtiter plate (such as polystyrene or 

30 polyvinylchloride) with an amount of binding agent ranging from about 10 ng to about 
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10 ng, and preferably about 100 ng to about 1 ng, is sufficient to immobilize an 
adequate amount of binding agent. 

Covalent attachment of binding agent to a solid support may generally be 
achieved by first reacting the support with a bifunctional reagent that will react with 
5 both the support and a functional group, such as a hydroxyl or amino group, on the 
binding agent. For example, the binding agent may be covalently attached to supports 
having an appropriate polymer coating using benzoquinone or by condensation of an 
aldehyde group on the support with an amine and an active hydrogen on the binding 
partner (see, e.g., Pierce Immunotechnology Catalog and Handbook, 1991, at 
10 A12-A13). 

In certain embodiments, the assay is a two-antibody sandwich assay. 
This assay may be performed by first contacting an antibody that has been immobilized 
on a solid support, commonly the well of a microtiter plate, with the sample, such that 
polypeptides within the sample are allowed to bind to the immobilized antibody. 

15 Unbound sample is then removed from the immobilized polypeptide-antibody 
complexes and a detection reagent (preferably a second antibody capable of binding to a 
different site on the polypeptide) containing a reporter group is added. The amount of 
detection reagent that remains bound to the solid support is then detemiined using a 
method appropriate for the specific reporter group. 

20 More specifically, once the antibody is immobilized on the support as 

described above, the remaining protein binding sites on the support are typically 
blocked. Any suitable blocking agent known to those of ordinary skill in the art, such as 
bovine serum albumin or Tween 20™ (Sigma Chemical Co., St. Louis, MO). The 
immobilized antibody is then incubated with the sample, and polypeptide is allowed to 

25 bind to the antibody. The sample may be diluted with a suitable diluent, such as 
phosphate-buffered saline (PBS) prior to incubation. In general, an appropriate contact 
time (i.e., incubation time) is a period of time that is sufficient to detect the presence of 
polypeptide within a sample obtained from an individual with lung cancer. Preferably, 
the contact time is sufficient to achieve a level of binding that is at least about 95% of 

30 that achieved at equilibrium between bound and unbound polypeptide. Those of 
ordinary skill in the art will recognize that the time necessary to achieve equilibrium 
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may be readily determined by assaying the level of binding that occurs over a period of 
time. At room temperature, an incubation time of about 30 minutes is generally 
sufficient. 

Unbound sample may then be removed by washing the solid support 
5 with an appropriate buffer, such as PBS containing 0.1% Tween 20™. The second 
antibody, which contains a reporter group, may then be added to the solid support. 
Preferred reporter groups include those groups recited above. 

The detection reagent is then incubated with the immobilized antibody- 
polypeptide complex for an amount of time sufficient to detect the bound polypeptide. 

10 An appropriate amount of time may generally be determined by assaying the level of 
binding that occurs over a period of time. Unbound detection reagent is then removed 
and bound detection reagent is detected using the reporter group. The method employed 
for detecting the reporter group depends upon the nature of the reporter group. For 
radioactive groups, scintillation counting or autoradiographic methods are generally 

15 appropriate. Spectroscopic methods may be used to detect dyes, luminescent groups 
and fluorescent groups. Biotin may be detected using avidin, coupled to a different 
reporter group (commonly a radioactive or fluorescent group or an enzyme). Enzyme 
reporter groups may generally be detected by the addition of substrate (generally for a 
specific period of time), followed by spectroscopic or other analysis of the reaction 

20 products. 

To determine the presence or absence of a cancer, such as lung cancer, 
the signal detected from the reporter group that remains bound to the solid support is 
generally compared to a signal that corresponds to a predetermined cut-off value. In 
one preferred embodiment, the cut-off value for the detection of a cancer is the average 

25 mean signal obtained when the immobilized antibody is incubated with samples from 
patients without the cancer. In general, a sample generating a signal that is three 
standard deviations above the predetermined cut-off value is considered positive for the 
cancer. In an alternate preferred embodiment, the cut-off value is determined using a 
Receiver Operator Curve, according to the method of Sackett et al., Clinical 

30 Epidemiology: A Basic Science for Clinical Medicine, Little Brown and Co., 1985, 
p. 106-7. Briefly, in this embodiment, the cut-off value may be determined from a plot 
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of pairs of true positive rates (i.e., sensitivity) and false positive rates (100%-specificity) 
that correspond to each possible cut-off value for the diagnostic test result. The cut-off 
value on the plot that is the closest to the upper left-hand corner (r'.e., the value that 
encloses the largest area) is the most accurate cut-off value, and a sample generating a 
5 signal that is higher than the cut-off value determined by this method may be considered 
positive. Alternatively, the cut-off value may be shifted to the left along the plot, to 
minimize the false positive rate, or to the right, to minimize the false negative rate. In 
general, a sample generating a signal that is higher than the cut-off value determined by 
this method is considered positive for a cancer. 

10 In a related embodiment, the assay is performed in a flow-through or 

strip test format, wherein the binding agent is immobilized on a membrane, such as 
nitrocellulose. In the flow-through test, polypeptides within the sample bind to the 
immobilized binding agent as the sample passes through the membrane. A second, 
labeled binding agent then binds to the binding agent-polypeptide complex as a solution 

15 containing the second binding agent flows through the membrane. The detection of 
bound second binding agent may then be performed as described above. In the strip test 
format, one end of the membrane to which binding agent is bound is immersed in a 
solution containing the sample. The sample migrates along the membrane through a 
region containing second binding agent and to the area of immobilized binding agent. 

20 Concentration of second binding agent at the area of immobilized antibody indicates the 
presence of a cancer. Typically, the concentration of second binding agent at that site 
generates a pattern, such as a line, that can be read visually. The absence of such a 
pattern indicates a negative result. In general, the amount of binding agent immobilized 
on the membrane is selected to generate a visually discernible pattern when the 

25 biological sample contains a level of polypeptide that would be sufficient to generate a 
positive signal in the two-antibody sandwich assay, in the format discussed above. 
Preferred binding agents for use in such assays are antibodies and antigen-binding 
fragments thereof. Preferably, the amount of antibody immobilized on the membrane 
ranges from about 25 ng to about lug, and more preferably from about 50 ng to about 

30 500 ng. Such tests can typically be performed with a very small amount of biological 
sample. 
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Of course, numerous other assay protocols exist that are suitable for use 
with the tumor proteins or binding agents of the present invention. The above 
descriptions are intended to be exemplary only. For example, it will be apparent to 
those of ordinary skill in the art that the above protocols may be readily modified to use 
5 tumor polypeptides to detect antibodies that bind to such polypeptides in a biological 
sample. The detection of such tumor protein specific antibodies may correlate with the 
presence of a cancer. 

A cancer may also, or alternatively, be detected based on the presence of 
T cells that specifically react with a tumor protein in a biological sample. Within 

10 certain methods, a biological sample comprising CD4 + and/or CD8 + T cells isolated 
from a patient is incubated with a tumor polypeptide, a polynucleotide encoding such a 
polypeptide and/or an APC that expresses at least an immunogenic portion of such a 
polypeptide, and the presence or absence of specific activation of the T cells is detected. 
Suitable biological samples include, but are not limited to, isolated T cells. For 

15 example, T cells may be isolated from a patient by routine techniques (such as by 
Ficoll/Hypaque density gradient centrifugation of peripheral blood lymphocytes). T 
cells may be incubated in vitro for 2-9 days (typically 4 days) at 37°C with polypeptide 
{e.g., 5-25 ug/ml). It may be desirable to incubate another aliquot of a T cell sample in 
the absence of tumor polypeptide to serve as a control. For CD4 + T cells, activation is 

20 preferably detected by evaluating proliferation of the T cells. For CD8 + T cells, 
activation is preferably detected by evaluating cytolytic activity. A level of proliferation 
that is at least two fold greater and/or a level of cytolytic activity that is at least 20% 
greater than in disease-free patients indicates the presence of a cancer in the patient. 

As noted above, a cancer may also, or alternatively, be detected based on 

25 the level of mRNA encoding a tumor protein in a biological sample. For example, at 
least two oligonucleotide primers may be employed in a polymerase chain reaction 
(PGR) based assay to amplify a portion of a tumor cDNA derived from a biological 
sample, wherein at least one of the oligonucleotide primers is specific for (i.e., 
hybridizes to) a polynucleotide encoding the tumor protein. The amplified cDNA is 

30 then separated and detected using techniques well known in the art, such as gel 
electrophoresis. 
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Similarly, oligonucleotide probes that specifically hybridize to a 
polynucleotide encoding a tumor protein may be used in a hybridization assay to detect 
the presence of polynucleotide encoding the tumor protein in a biological sample. 

To permit hybridization under assay conditions, oligonucleotide primers 
5 and probes should comprise an oligonucleotide sequence that has at least about 60%, 
preferably at least about 75% and more preferably at least about 90%, identity to a 
portion of a polynucleotide encoding a tumor protein of the invention that is at least 10 
nucleotides, and preferably at least 20 nucleotides, in length. Preferably, 
oligonucleotide primers and/or probes hybridize to a polynucleotide encoding a 

10 polypeptide described herein under moderately stringent conditions, as defined above. 
Oligonucleotide primers and/or probes which may be usefully employed in the 
diagnostic methods described herein preferably are at least 10-40 nucleotides in length. 
In a preferred embodiment, the oligonucleotide primers comprise at least 10 contiguous 
nucleotides, more preferably at least 15 contiguous nucleotides, of a DNA molecule 

15 having a sequence as disclosed herein. Techniques for both PCR based assays and 
hybridization assays are well known in the art (see, for example, Mullis et al., Cold 
Spring Harbor Symp. Quant. Biol, 51:263, 1987; Erlich ed., PCR Technology, Stockton 
Press, NY, 1989). 

One preferred assay employs RT-PCR, in which PCR is applied in 
20 conjunction with reverse transcription. Typically, RNA is extracted from a biological 
sample, such as biopsy tissue, and is reverse transcribed to produce cDNA molecules. 
PCR amplification using at least one specific primer generates a cDNA molecule, which 
may be separated and visualized using, for example, gel electrophoresis. Amplification 
may be performed on biological samples taken from a test patient and from an 
25 individual who is not afflicted with a cancer. The amplification reaction may be 
performed on several dilutions of cDNA spanning two orders of magnitude. A two-fold 
or greater increase in expression in several dilutions of the test patient sample as 
compared to the same dilutions of the non-cancerous sample is typically considered 
positive. 

30 In another aspect of the present invention, cell capture technologies may 

be used in conjunction, with, for example, real-time PCR to provide a more sensitive 
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tool for detection of metastatic cells expressing lung tumor antigens. Detection of lung 
cancer cells in biological samples, e.g., bone marrow samples, peripheral blood, and 
small needle aspiration samples is desirable for diagnosis and prognosis in lung cancer 
patients. 

5 Immunomagnetic beads coated with specific monoclonal antibodies to 

surface cell markers, or tetrameric antibody complexes, may be used to first enrich or 
positively select cancer cells in a sample. Various commercially available kits may be 
used, including Dynabeads® Epithelial Enrich (Dynal Biotech, Oslo, Norway), 
StemSep™ (StemCell Technologies, Inc., Vancouver, BC), and RosetteSep (StemCell 

10 Technologies). A skilled artisan will recognize that other methodologies and kits may 
also be used to enrich or positively select desired cell populations. Dynabeads® 
Epithelial Enrich contains magnetic beads coated with mAbs specific for two 
glycoprotein membrane antigens expressed on normal and neoplastic epithelial tissues. 
The coated beads may be added to a sample and the sample then applied to a magnet, 

15 thereby capturing the cells bound to the beads. The unwanted cells are washed away 
and the magnetically isolated cells eluted from the beads and used in further analyses. 

RosetteSep can be used to enrich cells directly from a blood sample and 
consists of a cocktail of tetrameric antibodies that targets a variety of unwanted cells 
and crosslinks them to glycophorin A on red blood cells (RBC) present in the sample, 

20 forming rosettes. When centrifuged over Ficoll, targeted cells pellet along with the free 
RBC. The combination of antibodies in the depletion cocktail determines which cells 
will be removed and consequently which cells will be recovered. Antibodies that are 
available include, but are not limited to: CD2, CD3, CD4, CD5, CD8, CD10, CDllb, 
CD14, CD15, CD16, CD19, CD20, CD24, CD25, CD29, CD33, CD34, CD36, CD38, 

25 CD41 , CD45, CD45RA, CD45RO, CD56, CD66B, CD66e, HLA-DR, IgE, and TCRap. 

Additionally, it is contemplated in the present invention that mAbs 
specific for lung tumor antigens can be generated and used in a similar manner. For 
example, mAbs that bind to tumor-specific cell surface antigens may be conjugated to 
magnetic beads, or formulated in a tetrameric antibody complex, and used to enrich or 

30 positively select metastatic lung tumor cells from a sample. Once a sample is enriched 
or positively selected, cells may be lysed and RNA isolated. RNA may then be 
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subjected to RT-PCR analysis using lung tumor-specific primers in a real-time PCR 
assay as described herein. One skilled in the art will recognize that enriched or selected 
populations of cells may be analyzed by other methods (e.g. in situ hybridization or 
flow cytometry). 

5 In another embodiment, the compositions described herein may be used 

as markers for the progression of cancer. In this embodiment, assays as described above 
for the diagnosis of a cancer may be performed over time, and the change in the level of 
reactive polypeptide(s) or polynucleotide(s) evaluated. For example, the assays may be 
performed every 24-72 hours for a period of 6 months to 1 year, and thereafter 

1 0 performed as needed. In general, a cancer is progressing in those patients in whom the 
level of polypeptide or polynucleotide detected increases over time. In contrast, the 
cancer is not progressing when the level of reactive polypeptide or polynucleotide either 
remains constant or decreases with time. 

Certain in vivo diagnostic assays may be performed directly on a tumor. 

15 One such assay involves contacting tumor cells with a binding agent. The bound 
binding agent may then be detected directly or indirectly via a reporter group. Such 
binding agents may also be used in histological applications. Alternatively, 
polynucleotide probes may be used within such applications. 

As noted above, to improve sensitivity, multiple tumor protein markers 

20 may be assayed within a given sample. It will be apparent that binding agents specific 
for different proteins provided herein may be combined within a single assay. Further, 
multiple primers or probes may be used concurrently. The selection of tumor protein 
markers may be based on routine experiments to determine combinations that results in 
optimal sensitivity. In addition, or alternatively, assays for tumor proteins provided 

25 herein may be combined with assays for other known tumor antigens. 

The present invention, further provides kits for use within any of the 
above diagnostic methods. Such kits typically comprise two or more components 
necessary for performing a diagnostic assay. Components may be compounds, reagents, 
containers and/or equipment. For example, one container within a kit may contain a 

30 monoclonal antibody or fragment thereof that specifically binds to a tumor protein. 
Such antibodies or fragments may be provided attached to a support material, as 
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described above. One or more additional containers may enclose elements, such as 
reagents or buffers, to be used in the assay. Such kits may also, or alternatively, contain 
a detection reagent as described above that contains a reporter group suitable for direct 
or indirect detection of antibody binding. 
5 Alternatively, a kit may be designed to detect the level of mRNA 

encoding a tumor protein in a biological sample. Such kits generally comprise at least 
one oligonucleotide probe or primer, as described above, that hybridizes to a 
polynucleotide encoding a tumor protein. Such an oligonucleotide may be used, for 
example, within a PCR or hybridization assay. Additional components that may be 
10 present within such kits include a second oligonucleotide and/or a diagnostic reagent or 
container to facilitate the detection of a polynucleotide encoding a tumor protein. 

The following examples are offered by way of illustration and not by 
way of limitation. 

EXAMPLES 



15 EXAMPLE 1 

Isolation and Characterization of cDNA Sequences Encoding 
Lung Tumor Polypeptides 



This example illustrates the isolation of cDNA molecules encoding lung 
tumor-specific polypeptides from lung tumor cDNA libraries. 

20 A. ISOLATION OF CDNA SEQUENCES FROM A LUNG SQUAMOUS CELL 
CARCINOMA LIBRARY 

A human lung squamous cell carcinoma cDNA expression library was 
constructed from poly A + RNA from a pool of two patient tissues using a Superscript 
Plasmid System for cDNA Synthesis and Plasmid Cloning kit (BRL Life Technologies, 
25 Gaithersburg, MD) following the manufacturer's protocol. Specifically, lung carcinoma 
tissues were homogenized with polytron (Kinematica, Switzerland) and total RNA was 
extracted using Trizol reagent (BRL Life Technologies) as directed by the manufacturer. 
The poly A + RNA was then purified using an oligo dT cellulose column as described in 
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Sambrook etal, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
Laboratories, Cold Spring Harbor, NY, 1989. First-strand cDNA was synthesized using 
the NotI/Olig0-dT18 primer. Double- stranded cDNA was synthesized, ligated with 
BstXI/EcoRI adaptors (Invitrogen, San Diego, CA) and digested with Notl. Following 
5 size fractionation with cDNA size fractionation columns (BRL Life Technologies), the 
cDNA was ligated into the BstXI/NotI site of pcDNA3.1 (Invitrogen) and transformed 
into ElectroMax E. coli DH10B cells (BRL Life Technologies) by electroporation. 

Using the same procedure, a normal human lung cDNA expression 
library was prepared from a pool of four tissue specimens. The cDNA libraries were 

10 characterized by determining the number of independent colonies, the percentage of 
clones that carried insert, the average insert size and by sequence analysis. The lung 
squamous cell carcinoma library contained 2.7 x 10 6 independent colonies, with 100% 
of clones having an insert and the average insert size being 2100 base pairs. The normal 
lung cDNA library contained 1.4 x 10 6 independent colonies, with 90% of clones 

15 having inserts and the average insert size being 1800 base pairs. For both libraries, 
sequence analysis showed that the majority of clones had a full length cDNA sequence 
and were synthesized from mRNA 

cDNA library subtraction was performed using the above lung squamous 
cell carcinoma and normal lung cDNA libraries, as described by Hara et ah {Blood, 

20 54:189-199, 1994) with some modifications. Specifically, a lung squamous cell 
carcinoma-specific subtracted cDNA library was generated as follows. Normal tissue 
cDNA library (80 ug) was digested with BamHI and Xhol, followed by a filling-in 
reaction with DNA polymerase Klenow fragment. After phenol-chloroform extraction 
and ethanol precipitation, the DNA was dissolved in 133 ul of H2O, heat-denatured and 

25 mixed with 133 ul (133 jug) of Photoprobe biotin (Vector Laboratories, Burlingame, 
CA). As recommended by the manufacturer, the resulting mixture was irradiated with a 
270 W sunlamp on ice for 20 minutes. Additional Photoprobe biotin (67 ul) was added 
and the biotinylation reaction was repeated. After extraction with butanol five times, 
the DNA was ethanol-precipitated and dissolved in 23 ul H 2 0 to form the driver DNA. 

30 To form the tracer DNA, 10 jag lung squamous cell carcinoma cDNA 

library was digested with Notl and Spel, phenol chloroform extracted and passed 
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through Chroma spin-400 columns (Clontech, Palo Alto, CA). Typically, 5 ug of 
cDNA was recovered after the sizing column. Following ethanol precipitation, the 
tracer DNA was dissolved in 5 ul H2O. Tracer DNA was mixed with 15 ul driver DNA 
and 20 ul of 2 x hybridization buffer (1.5 M NaCl/10 mM EDTA/50 mM HEPES pH 
5 7.5/0.2% sodium dodecyl sulfate), overlaid with mineral oil, and heat-denatured 
completely. The sample was immediately transferred into a 68 °C water bath and 
incubated for 20 hours (long hybridization [LH]). The reaction mixture was then 
subjected to a streptavidin treatment followed by phenol/chloroform extraction. This 
process was repeated three more times. Subtracted DNA was precipitated, dissolved in 

10 12 ul H 2 0, mixed with 8 pi driver DNA and 20 pi of 2 x hybridization buffer, and 
subjected to a hybridization at 68 °C for 2 hours (short hybridization [SH]). After 
removal of biotinylated double-stranded DNA, subtracted cDNA was ligated into 
Notl/Spel site of chloramphenicol resistant pBCSK + (Stratagene, La Jolla, CA) and 
transformed into ElectroMax E. coli DH10B cells by electroporation to generate a lung 

15 squamous cell carcinoma specific subtracted cDNA library (herein after referred to as 
"lung subtraction I"). 

A second lung squamous cell carcinoma specific subtracted cDNA 
library (referred to as "lung subtraction II") was generated in a similar way to the lung 
subtraction library I, except that eight frequently recovered genes from lung subtraction 

20 I were included in the driver DNA, and 24,000 independent clones were recovered. 

To analyze the subtracted cDNA libraries, plasmid DNA was prepared 
from 320 independent clones, randomly picked from the subtracted lung squamous cell 
carcinoma specific libraries. Representative cDNA clones were further characterized by 
DNA sequencing with a Perkin Elmer/Applied Biosystems Division Automated 

25 Sequencer Model 373A and/or Model 377 (Foster City, CA). The cDNA sequences for 
sixty isolated clones are provided in SEQ ID NO: 1-60. These sequences were 
compared to known sequences in the gene bank using the EMBL and GenBank 
databases (release 96). No significant homologies were found to the sequences 
provided in SEQ ID NO: 2, 3, 19, 38 and 46. The sequences of SEQ ID NO: 1, 6-8, 10- 

30 13, 15, 17, 18, 20-27, 29, 30, 32, 34-37, 39-45, 47-49, 51, 52, 54, 55 and 57-59 were 
found to show some homology to previously identified expressed sequence tags (ESTs). 
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The sequences of SEQ ID NO: 9, 28, 31 and 33 were found to show some homology to 
previously identified non-human gene sequences and the sequences of SEQ ID NO: 4, 
5, 14, 50, 53, 56 and 60 were found to show some homology to gene sequences 
previously identified in humans. 
5 The subtraction procedure described above was repeated using the above 

lung squamous cell carcinoma cDNA library as the tracer DNA, and the above normal 
lung tissue cDNA library and a cDNA library from normal liver and heart (constructed 
from a pool of one sample of each tissue as described above), plus twenty other cDNA 
clones that were frequently recovered in lung subtractions I and II, as the driver DNA 

10 (lung subtraction III). The normal liver and heart cDNA library contained 1.76 x 10 6 
independent colonies, with 100% of clones having inserts and the average insert size 
being 1600 base pairs. Ten additional clones were isolated (SEQ ID NO: 61-70). 
Comparison of these cDNA sequences with those in the gene bank as described above, 
revealed no significant homologies to the sequences provided in SEQ ID NO: 62 and 

15 67. The sequences of SEQ ID NO: 61, 63-66, 68 and 69 were found to show some 
homology to previously isolated ESTs and the sequence provided in SEQ ID NO: 70 
was found to show some homology to a previously identified rat gene. 

In further studies, the subtraction procedure described above was 
repeated using the above lung squamous cell carcinoma cDNA library as the tracer 

20 DNA, and a cDNA library from a pool of normal lung, kidney, colon, pancreas, brain, 
resting PBMC, heart, skin and esophagus as the driver DNA, with esophagus cDNAs 
making up one third of the driver material. Since esophagus is enriched in normal 
epithelial cells, including differentiated squamous cells, this procedure is likely to 
enrich genes that are tumor specific rather than tissues specific. The cDNA sequences 

25 of 48 clones determined in this subtraction are provided in SEQ ID NO: 177-224. The 
sequences of SEQ ID NO: 177, 178, 180, 181, 183, 187, 192, 195-197, 208, 211, 212, 
215, 216, 218 and 219 showed some homology to previously identified genes. The 
sequences of SEQ ID NO: 179, 182, 184-186, 188-191, 193, 194, 198-207, 209 210, 
213, 214, 217, 220 and 224 showed some homology to previously determined ESTs. 

30 The sequence of SEQ ID NO: 221-223 showed no homology to any previously 
determined sequence. 



112 



WO 02/47534 



PCT7US01/47576 



B. ISOLATION OF cDNA SEQUENCES FROM A LUNG 
ADENOCARCINOMA LIBRARY 

A human lung adenocarcinoma cDNA expression library was 
constructed as described above. The library contained 3.2 x 10 6 independent colonies, 
5 with 100% of clones having an insert and the average insert size being 1500 base pairs. 
Library subtraction was performed as described above using the normal lung and 
normal liver and heart cDNA expression libraries described above as the driver DNA. 
Twenty-six hundred independent clones were recovered. 

Initial cDNA sequence analysis from 100 independent clones revealed 

10 many ribosomal protein genes. The cDNA sequences for fifteen clones isolated in this 
subtraction are provided in SEQ ID NO: 71-86. Comparison of these sequences with 
those in the gene bank as described above revealed no significant homologies to the 
sequence provided in SEQ ID NO: 84. The sequences of SEQ ID NO: 71, 73, 74, 77, 
78 and 80-82 were found to show some homology to previously isolated ESTs, and the 

15 sequences of SEQ ID NO: 72, 75, 76, 79, 83 and 85 were found to show some 
homology to previously identified human genes. 

In further studies, a cDNA library (referred to as mets3616A) was 
constructed from a metastatic lung adenocarcinoma. The determined cDNA sequences 
of 25 clones sequenced at random from this library are provided in SEQ ID NO: 255- 

20 279. The mets3616A cDNA library was subtracted against a cDNA library prepared 
from a pool of normal lung, liver, pancreas, skin, kidney, brain and resting PBMC. To 
increase the specificity of the subtraction, the driver was spiked with genes that were 
determined to be most abundant in the mets3616A cDNA library, such as EF1 -alpha, 
integrin-beta and anticoagulant protein PP4, as well as with cDNAs that were 

25 previously found to be differentially expressed in subtracted lung adenocarcinoma 
cDNA libraries. The determined cDNA sequences of 51 clones isolated from the 
subtracted library (referred to as mets3616A-Sl) are provided in SEQ ID NO: 280-330. 

Comparison of the sequences of SEQ ID NO: 255-330 with those in the 
public databases revealed no significant homologies to the sequences of SEQ ID NO: 

30 255-258, 260, 262-264, 270, 272, 275, 276, 279, 281, 287, 291, 296, 300 and 310. The 
sequences of SEQ ID NO: 259, 261, 265-269, 271, 273, 274, 277, 278, 282-285, 288- 
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290, 292, 294, 297-299, 301, 303-309, 313, 314, 316, 320-324 and 326-330 showed 
some homology to previously identified gene sequences, while the sequences of SEQ ID 
NO: 280, 286, 293, 302, 310, 312, 315, 317-319 and 325 showed some homology to 
previously isolated expressed sequence tags (ESTs). 

5 EXAMPLE 2 

Determination of Tissue Specificity of Lung Tumor Polypeptides 

Using gene specific primers, mRNA expression levels for seven 
representative lung tumor polypeptides described in Example 1 were examined in a 
variety of normal and tumor tissues using RT-PCR. 

10 Briefly, total RNA was extracted from a variety of normal and tumor 

tissues using Trizol reagent as described above. First strand synthesis was carried out 
using 2 jug of total RNA with Superscript II reverse transcriptase (BRL Life 
Technologies) at 42 °C for one hour. The cDNA was then amplified by PCR with gene- 
specific primers. To ensure the semi-quantitative nature of the RT-PCR, P-actin was 

15 used as an internal control for each of the tissues examined. 1 jal of 1:30 dilution of 
cDNA was employed to enable the linear range amplification of the p-actin template 
and was sensitive enough to reflect the differences in the initial copy numbers. Using 
these conditions, the P-actin levels were determined for each reverse transcription 
reaction from each tissue. DNA contamination was minimized by DNase treatment and 

20 by assuring a negative PCR result when using first strand cDNA that was prepared 
without adding reverse transcriptase. 

mRNA Expression levels were examined in five different types of tumor 
tissue (lung squamous cell carcinoma from 3 patients, lung adenocarcinoma, colon 
tumor from 2 patients, breast tumor and prostate tumor), and thirteen different normal 

25 tissues (lung from 4 donors, prostate, brain, kidney, liver, ovary, skeletal muscle, skin, 
small intestine, stomach, myocardium, retina and testes). Using a 10-fold amount of 
cDNA, the antigen LST-S1-90 (SEQ ID NO: 3) was found to be expressed at high 
levels in lung squamous cell carcinoma and in breast tumor, and at low to undetectable 
levels in the other tissues examined. 
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The antigen LST-S2-68 (SEQ ID NO: 15) appears to be specific to lung 
and breast tumor, however, expression was also detected in normal kidney. Antigens 
LST-S1-169 (SEQ ID NO: 6) and LST-S1-133 (SEQ ID NO: 5) appear to be very 
abundant in lung tissues (both normal and tumor), with the expression of these two 
5 genes being decreased in most of the normal tissues tested. Both LST-S 1-169 and LST- 
S 1-133 were also expressed in breast and colon tumors. Antigens LST-S 1-6 (SEQ ID 
NO: 7) and LST-S2-I2-5F (SEQ ID NO: 47) did not show tumor or tissue specific 
expression, with the expression of LST-S 1-28 being rare and only detectable in a few 
tissues. The antigen LST-S3-7 (SEQ ID NO: 63) showed lung and breast tumor specific 

10 expression, with its message only being detected in normal testes when the PGR was 
performed for 30 cycles. Lower level expression was detected in some normal tissues 
when the cycle number was increased to 35. Antigen LST-S3-13 (SEQ ID NO: 66) was 
found to be expressed in 3 out of 4 lung tumors, one breast tumor and both colon tumor 
samples. Its expression in normal tissues was lower compared to tumors, and was only 

15 detected in 1 out of 4 normal lung tissues and in normal tissues from kidney, ovary and 
retina. Expression of antigens LST-S3-4 (SEQ ID NO: 62) and LST-S3-14 (SEQ ID 
NO: 67) was rare and did not show any tissue or tumor specificity. Consistent with 
Northern blot analyses, the RT-PCR results on antigen LAT-S1-A-10A (SEQ ID NO: 
78) suggested that its expression is high in lung, colon, stomach and small intestine 

20 tissues, including lung and colon tumors, whereas its expression was low or 
undetectable in other tissues. 

A total of 2002 cDNA fragments isolated in lung subtractions I, II and 
III, described above, were colony PCR amplified and their mRNA expression levels in 
lung tumor, normal lung, and various other normal and tumor tissues were determined 

25 using microarray technology (Synteni, Palo Alto, CA). Briefly, the PCR amplification 
products were dotted onto slides in an array format, with each product occupying a 
unique location in the array. mRNA was extracted from the tissue sample to be tested, 
reverse transcribed, and fluorescent-labeled cDNA probes were generated. The 
microarrays were probed with the labeled cDNA probes, the slides scanned and 

30 fluorescence intensity was measured. This intensity correlates with the hybridization 
intensity. Seventeen non-redundant cDNA clones showed over-expression in lung 
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squamous tumors, with expression in normal tissues tested (lung, skin, lymph node, 
colon, liver, pancreas, breast, heart, bone marrow, large intestine, kidney, stomach, 
brain, small intestine, bladder and salivary gland) being either undetectable, or 10-fold 
less compared to lung squamous tumors. The determined cDNA sequences for the 
5 clone L513S are provided in SEQ ID NO: 87 and 88; those for L514S are provided in 
SEQ ID NO: 89 and 90; those for L516S in SEQ ID NO: 91 and 92; that for L517S in 
SEQ ID NO: 93; that for L5 19S in SEQ ID NO: 94; those for L520S in SEQ ID NO: 95 
and 96; those for L521S in SEQ ID NO: 97 and 98; that for L522S in SEQ ID NO: 99; 
that for L523S in SEQ ID NO: 100; that for L524S in SEQ ID NO: 101; that for L525S 

10 in SEQ ID NO: 102; that for L526S in SEQ ID NO: 103; that for L527S in SEQ ID NO: 
104; that for L528S in SEQ ID NO: 105; that for L529S in SEQ ID NO: 106; and those 
for L530S in SEQ ID NO: 107 and 108. Additionally, the full-length cDNA sequence 
for L530S is provided in SEQ ID NO: 151, with the corresponding amino acid sequence 
being provided in SEQ ID NO: 152. L530S shows homology to a splice variant of a 

15 p53 tumor suppressor homologue, p63. The cDNA sequences of 7 known isoforms of 
p63 are provided in SEQ ID NO: 331-337, with the corresponding amino acid 
sequences being provided in SEQ ID NO: 338-344, respectively. 

Due to polymorphisms, the clone L531S appears to have two forms. A 
first determined full-length cDNA sequence for L531S is provided in SEQ ID NO: 109, 

20 with the corresponding amino acid sequence being provided in SEQ ID NO: 110. A 
second determined full-length cDNA sequence for L531S is provided in SEQ ID NO: 
111, with the corresponding amino acid sequence being provided in SEQ ID NO: 112. 
The sequence of SEQ ID NO: 1 1 1 is identical to that of SEQ ID NO: 109, except that it 
contains a 27 bp insertion. Similarly, L514S has two alternatively spliced forms; the 

25 first variant cDNA is listed as SEQ ID NO: 153, with the corresponding amino acid 
sequence being provided in SEQ ID NO: 155. The full-length cDNA for the second 
variant form of L514S is provided in SEQ ID NO: 154, with the corresponding amino 
acid sequence being provided in SEQ ID NO: 156. 

Full length cloning for L524S (SEQ ID NO: 101) yielded two variants 

30 (SEQ ID NO: 163 and 164) with the corresponding amino acid sequences of SEQ ID 
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NO: 165 and 166, respectively. Both variants have been shown to encode parathyroid 
hormone-related peptide. 

Attempts to isolate the full-length cDNA for L519S, resulted in the 
isolation of the extended cDNA sequence provided in SEQ ID NO: 173, which contains 
5 a potential open reading frame. The amino acid sequence encoded by the sequence of 
SEQ ID NO: 173 is provided in SEQ ID NO: 174. Additionally, the full-length cDNA 
sequence for the clone of SEQ ID NO: 100 (known as L523S), a known gene, is 
provided in SEQ ID NO: 175, with the corresponding amino acid sequence being 
provided in SEQ ID NO: 176. In further studies, a full-length cDNA sequence for 

10 L523S was isolated from a L523S-positive tumor cDNA library by PCR amplification 
using gene specific primers designed from the sequence of SEQ ID NO: 175. The 
determined full-length cDNA sequence is provided in SEQ ID NO: 347. The amino 
acid sequence encoded by this sequence is provided in SEQ ID NO: 348. This protein 
sequence differs from the previously published protein sequence at two amino acid 

1 5 positions, namely at positions 1 5 8 and 410. 

Comparison of the sequences of L514S and L531S (SEQ ID NO: 87 and 
88, and 109, respectively) with those in the gene bank, as described above, revealed no 
significant homologies to known sequences. The sequences of L513S, L516S, L517S, 
L519S, L520S and L530S (SEQ ID NO: 87 and 88, 91 and 92, 93, 94, 95 and 96, 107 

20 and 108, respectively) were found to show some homology to previously identified 
ESTs. The sequences of L521S, L522S, L523S, L524S, L525S, L526S, L527S, L528S 
and L529S (SEQ ID NO: 97 and 98, 99, 99, 101, 102, 103, 104, 105, and 106, 
respectively) were found to represent known genes. The determined full-length cDNA 
sequence for L520S is provided in SEQ ID NO: 113, with the corresponding amino 

25 acid sequence being provided in SEQ ID NO: 114. Subsequent microarray analysis 
showed L520S to be overexpressed in breast tumors in addition to lung squamous 
tumors. 

Further analysis demonstrated that L529S (SEQ ID NO: 106 and 115), 
L525S (SEQ ID NO: 102 and 120) and L527S (SEQ ID NO: 104) are cytoskeletal 
30 components and potentially squamous cell specific proteins. L529S is connexin 26, a 
gap junction protein. It was found to be highly expressed in one lung squamous tumor, 
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referred to as 9688T, and moderately over-expressed in two others. However, lower 
level expression of connexin 26 is also detectable in normal skin, colon, liver and 
stomach. The over-expression of connexin 26 in some breast tumors has been reported 
and a mutated form of L529S may result in over-expression in lung tumors. L525S is 
5 plakophilin 1, a desmosomal protein found in plaque-bearing adhering junctions of the 
skin. Expression levels for L525S mRNA was highly elevated in three out of four lung 
squamous tumors tested, and in normal skin. L527S has been identified as keratin 6 
isoform, type II 58 Kd keratin and cytokeratin 13, and shows over-expression in 
squamous tumors and low expression in normal skin, breast and colon tissues. Keratin 
10 and keratin-related genes have been extensively documented as potential markers for 
lung cancer including CYFRA2.1 (Pastor, A., et al, Eur. Respir. J., 10:603-609, 1997). 
L513S (SEQ ID NO: 87 and 88) shows moderate over-expression in several tumor 
tissues tested, and encodes a protein that was first isolated as a pemphigus vulgaris 
antigen. 

1 5 L520S (SEQ ID NO: 95 and 96) and L52 1 S (SEQ ID NO: 97 and 98) are 

highly expressed in lung squamous tumors, with L520S being up-regulated in normal 
salivary gland and L521S being over-expressed in normal skin. Both belong to a family 
of small proline rich proteins and represent markers for fully differentiated squamous 
cells. L521S has been described as a specific marker for lung squamous tumor (Hu, R., 

20 et al, Lung Cancer, 20:25-30, 1998). L515S (SEQ ED NO: 162) encodes IGF-P2 and 
L516S is an aldose reductase homologue. Both are moderately expressed in lung 
squamous tumors and in normal colon. Notably, L516S (SEQ ID NO: 91 and 92) is up- 
regulated in metastatic tumors but not primary lung adenocarcinoma, an indication of its 
potential role in metatasis and a potential prognostic marker. L522S (SEQ ID NO: 99) 

25 is moderately over-expressed in lung squamous tumors with minimum expression in 
normal tissues. L522S has been shown to belong to a class IV alcohol dehydrogenase, 
ADH7, and its expression profile suggests it is a squamous cell specific antigen. L523S 
(SEQ ID NO: 100) is moderately over-expressed in lung squamous tumor, human 
pancreatic cancer cell lines and pancreatic cancer tissues, suggesting this gene may be a 

30 shared antigen between pancreatic and lung squamous cell cancer. 
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L524S (SEQ ID NO: 101) is over-expressed in the majority of squamous 
tumors tested and is homologous with parathyroid hormone-related peptide (PTHrP), 
which is best known to cause humoral hypercalcaemia associated with malignant 
tumors such as leukemia, prostate and breast cancer. It is also believed that PTHrP is 
5 most commonly associated with squamous carcinoma of lung and rarely with lung 
adenocarcinoma (Davidson, L.A., et al, J. Pathol, 178: 398-401, 1996). L528S (SEQ 
ID NO: 105) is highly over-expressed in two lung squamous tumors with moderate 
expression in two other squamous tumors, one lung adenocarcinoma and some normal 
tissues, including skin, lymph nodes, heart, stomach and lung. It encodes the NMB 

10 gene that is similar to the precursor of melanocyte specific gene Pmell7, which is 
reported to be preferentially expressed in low-metastatic potential melanoma cell lines. 
This suggests that L528S may be a shared antigen in both melanoma and lung squamous 
cell carcinoma. L526S (SEQ ID NO: 103) was overexpressed in all lung squamous cell 
tumor tissues tested and has been shown to share homology with a gene (ATM) in 

15 which a mutation causes ataxia telangiectasia, a genetic disorder in humans causing a 
predisposition to cancer, among other symptoms. ATM encodes a protein that activates 
a p53 mediated cell-cycle checkpoint through direct binding and phosphorylation of the 
p53 molecule. Approximately 40% of lung cancers are associated with p53 mutations, 
and it is speculated that over-expression of ATM is a result of compensation for loss of 

20 p53 function, but it is unknown whether over-expression is the cause of result of lung 
squamous cell carcinoma. Additionally, expression of L526S (ATM) is also detected in 
a metastatic but not lung adenocarcinoma, suggesting a role in metastasis. 

Expression of L523S (SEQ ID NO: 175), was examined by real time 
RT-PCR as described above. In a first study using a panel of lung squamous tumors, 

25 L523S was found to be expressed in 4/7 lung squamous tumors, 2/3 head and neck 
squamous tumors and 2/2 lung adenocarcinomas, with low level expression being 
observed in skeletal muscle, soft palate and tonsil. In a second study using a lung 
adenocarcinoma panel, expression of L523S was observed in 4/9 primary 
adenocarcinomas, 2/2 lung pleural effusions, 1/1 metastatic lung adenocarcinomas and 

30 2/2 lung squamous tumors, with little expression being observed in normal tissues. 
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Expression of L523S in lung tumors and various normal tissues was also 
examined by Northern blot analysis, using standard techniques. In a first study, L523S 
was found to be expressed in a number of lung adenocarcinomas and squamous cell 
carcinomas, as well as normal tonsil. No expression was observed in normal lung. In a 
5 second study using a normal tissue blot (referred to as HB-12) from Clontech, no 
expression was observed in brain, skeletal muscle, colon, thymus, spleen, kidney, liver, 
small intestine, lung or PBMC, although there was strong expression in placenta. 

EXAMPLE 3 

Isolation and Characterization of Lung Tumor Polypeptides 
10 by PCR-Based Subtraction 

Eight hundred and fifty seven clones from a cDNA subtraction library, 
containing cDNA from a pool of two human lung squamous tumors subtracted against 
eight normal human tissue cDNAs including lung, PBMC, brain, heart, kidney, liver, 
pancreas, and skin, (Clontech, Palo Alto, CA) were derived and submitted to a first 

15 round of PCR amplification. This library was subjected to a second round of PCR 
amplification, following the manufacturer's protocol. The resulting cDNA fragments 
were subcloned into the P7-Adv vector (Clontech, Palo Alto, CA) and transformed into 
DH5a E. coli (Gibco, BRL). DNA was isolated from independent clones and 
sequenced using a Perkin Elmer/Applied Biosystems Division Automated Sequencer 

20 Model 373A. 

One hundred and sixty two positive clones were sequenced. Comparison 
of the DNA sequences of these clones with those in the EMBL and GenBank databases, 
as described above, revealed no significant homologies to 13 of these clones, hereinafter 
referred to as Contigs 13, 16, 17, 19, 22, 24, 29, 47, 49, 56-59. The determined cDNA 
25 sequences for these clones are provided in SEQ ID NO: 125, 127-129, 131-133, 142, 
144, 148-150, and 157, respectively. Contigs 1, 3-5, 7-10, 12, 11, 15, 20, 31, 33, 38, 39, 
41, 43, 44, 45, 48, 50, 53, 54 (SEQ ID NO: 115-124, 126, 130, 134-141, 143, 145-147, 
respectively) were found to show some degree of homology to previously identified 
DNA sequences. Contig 57 (SEQ ID NO: 149) was found to represent the clone L519S 
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(SEQ ID NO: 94) disclosed in US. Patent Application No. 09/123,912, filed July 27, 
1998. To the best of the inventors' knowledge, none of these sequences have been 
previously shown to be differentially over-expressed in lung tumors. 

mRNA expression levels for representative clones in lung tumor tissues, 
5 normal lung tissues (n=4), resting PBMC, salivary gland, heart, stomach, lymph nodes, 
skeletal muscle, soft palate, small intestine, large intestine, bronchial, bladder, tonsil, 
kidney, esophagus, bone marrow, colon, adrenal gland, pancreas, and skin (all derived 
from human) were determined by RT-PCR as described above. Expression levels using 
microarray technology, as described above, were examined in one sample of each tissue 

10 type unless otherwise indicated. 

Contig 3 (SEQ ID NO: 116) was found to be highly expressed in all head 
and neck squamous cell tumors tested (17/17), and expressed in the majority (8/12) of 
lung squamous tumors, (high expression in 7/12, moderate in 2/12, and low in 2/12), 
while showing negative expression for 2/4 normal lung tissues and low expression in 

15 the remaining two samples. Contig 3 showed moderate expression in skin and soft 
palate, and lowered expression levels in resting PBMC, large intestine, salivary gland, 
tonsil, pancreas, esophagus, and colon. Contig 11 (SEQ ID NO: 124) was found to be 
expressed in all head and neck squamous cell tumors tested (17/17), with high levels of 
expression being seen in 14/17 tumors, and moderately levels of expression being seen 

20 in 3/17 tumors. Additionally, high expression was seen in 3/12 lung squamous tumors 
and moderate expression in 4/12 lung squamous tumors. Contig 1 1 was negative for 3/4 
normal lung samples, with the remaining sample having only low expression. Contig 
1 1 showed low to moderate reactivity to salivary gland, soft palate, bladder, tonsil, skin, 
esophagus, and large intestine. Contig 13 (SEQ ID NO: 125) was found to be expressed 

25 in all head and neck squamous cell tumors tested (17/17), with high expression in 
12/17, and moderate expression in 5/17. Contig 13 was expressed in 7/12 lung 
squamous tumors, with high expression in 4/12 and moderate expression in three 
samples. Analysis of normal lung samples showed negative expression for 2/4 and low 
to moderate expression in the remaining two samples. Contig 13 showed low to 

30 moderate reactivity to resting PBMC, salivary gland, bladder, pancreas, tonsil, skin, 
esophagus, and large intestine, as well as high expression in soft palate. Subsequent 
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full-length cloning efforts revealed that contig 13 (also known as L761P) maps to the 3' 
untranslated region of the hSeclOp gene. The full-length sequence for this gene is set 
forth in SEQ ID NO: 368, and encodes the protein set forth in SEQ ID NO: 369. 

Contig 16 (SEQ ID NO: 127) was found to be moderately expressed in 
5 several head and neck squamous cell tumors (6/17) and one lung squamous tumor, 
while showing no expression in any normal lung samples tested. Contig 16 showed low 
reactivity to resting PBMC, large intestine, skin, salivary gland, and soft palate. Contig 
17 (SEQ ID NO: 128) was shown to be expressed in all head and neck squamous cell 
tumors tested (17/17) (highly expressed in 5/17, and moderately expressed in 12/17). 

10 Determination of expression levels in lung squamous tumors showed one tumor sample 
with high expression and 3/12 with moderate levels. Contig 17 was negative for 2/4 
normal lung samples, with the remaining samples having only low expression. 
Additionally, low level expression was found in esophagus and soft palate. Contig 19 
(SEQ ID NO: 129) was found to be expressed in most head and neck squamous cell 

15 tumors tested (11/17); with two samples having high expression levels, 6/17 showing 
moderate expression, and low expression being found in 3/17. Testing in lung 
squamous tumors revealed only moderate expression in 3/12 samples. Expression 
levels in 2/4 of normal lung samples were negative, the two other samples having only 
low expression. Contig 19 showed low expression levels in esophagus, resting PBMC, 

20 salivary gland, bladder, soft palate and pancreas. 

Contig 22 (SEQ ID NO: 131), was shown to be expressed in most head 
and neck squamous cell tumors tested (13/17) with high expression in four of these 
samples, moderate expression in 6/17, and low expression in 3/17. Expression levels in 
lung squamous tumors were found to be moderate to high for 3/12 tissues tested, with 

25 negative expression in two normal lung samples and low expression in two other 
samples (n=4). Contig 22 showed low expression in skin, salivary gland and soft 
palate. Similarly, Contig 24 (SEQ ID NO: 132) was found to be expressed in most head 
and neck squamous cell tumors tested (13/17) with high expression in three of these 
samples, moderate expression in 6/17, and low expression in 4/17. Expression levels in 

30 lung squamous tumors were found to be moderate to high for 3/12 tissues tested, with 
negative expression for three normal lung samples and low expression in one sample 
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(n=4). Contig 24 showed low expression in skin, salivary gland and soft palate. 
Contig 29 (SEQ ID NO: 133) was expressed in nearly all head and neck squamous cell 
tumors tested (16/17): highly expressed in 4/17, moderately expressed in 11/17, with 
low expression in one sample. Also, it was moderately expressed in 3/12 lung 
5 squamous tumors, while being negative for 2/4 normal lung samples. Contig 29 
showed low to moderate expression in large intestine, skin, salivary gland, pancreas, 
tonsil, heart and soft palate. Contig 47 (SEQ ID NO: 142) was expressed in most head 
and neck squamous cell tumors tested (12/17): moderate expression in 10/17, and low 
expression in two samples. In lung squamous tumors, it was highly expressed in one 
10 sample and moderately expressed in two others (n=13). Contig 47 was negative for 2/4 
normal lung samples, with the remaining two samples having moderate expression. 
Also, Contig 47 showed moderate expression in large intestine, and pancreas, and low 
expression in skin, salivary gland, soft palate, stomach, bladder, resting PBMC, and 
tonsil. 

15 Contig 48 (SEQ ID NO: 143) was expressed in all head and neck 

squamous cell tumors tested (17/17): highly expressed in 8/17 and moderately 
expressed in 7/17, with low expression in two samples. Expression levels in lung 
squamous tumors were high to moderate in three samples (n=13). Contig 48 was 
negative for one out of four normal lung samples, the remaining showing low or 

20 moderate expression. Contig 48 showed moderate expression in soft palate, large 
intestine, pancreas, and bladder, and low expression in esophagus, salivary gland, 
resting PBMC, and heart. Contig 49 (SEQ ID NO: 144) was expressed at low to 
moderate levels in 6/17 head and neck squamous cell tumors tested. Expression levels 
in lung squamous tumors were moderate in three samples (n=13). Contig 49 was 

25 negative for 2/4 normal lung samples, the remaining samples showing low expression. 
Moderate expression levels in skin, salivary gland, large intestine, pancreas, bladder and 
resting PBMC were shown, as well as low expression in soft palate, lymph nodes, and 
tonsil. Contig 56 (SEQ ID NO: 148) was expressed in low to moderate levels in 3/17 
head and neck squamous cell tumors tested, and in lung squamous tumors, showing low 

30 to moderate levels in three out of thirteen samples. Notably, low expression levels were 
detected in one adenocarcinoma lung tumor sample (n^). Contig 56 was negative for 
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3/4 normal lung samples, and showed moderate expression levels in only large intestine, 
and low expression in salivary gland, soft palate, pancreas, bladder, and resting PBMC. 
Contig 58, also known as L769P, (SEQ ID NO: 150) was expressed at moderate levels 
in 11/17 head and neck squamous cell tumors tested and low expression in one 
5 additional sample. Expression in lung squamous tumors showed low to moderate levels 
in three out of thirteen samples. Contig 58 was negative for 3/4 normal lung samples, 
with one sample having low expression. Moderate expression levels in skin, large 
intestine, and resting PBMC were demonstrated, as well as low expression in salivary 
gland, soft palate, pancreas, and bladder. Contig 59 (SEQ ID NO: 157) was expressed in 

10 some head, neck, and lung squamous tumors. Low level expression of Contig 59 was 
also detected in salivary gland and large intestine. 

The full-length cDNA sequence for Contig 22, also referred to as L763P, 
is provided in SEQ ID NO: 158, with the corresponding amino acid sequence being 
provided in SEQ ID NO: 159. Real-time RT-PCR analysis of L763P revealed that it is 

15 highly expressed in 3/4 lung squamous tumors as well as 4/4 head and neck squamous 
tumors, with low level expression being observed in normal brain, skin, soft pallet and 
trachea. Subsequent database searches revealed that the sequence of SEQ ID NO: 158 
contains a mutation, resulting in a frameshift in the corresponding protein sequence. A 
second cDNA sequence for L763P is provided in SEQ ID NO: 345, with the 

20 corresponding amino acid sequence being provided in SEQ ID NO: 346. The sequences 
of SEQ ID NO: 159 and 346 are identical with the exception of the C-terminal 33 amino 
acids of SEQ ID NO: 159. 

The full-length cDNA sequence incorporating Contigs 17, 19, and 24, 
referred to as L762P, is provided in SEQ ID NO: 160, with the corresponding amino 

25 acid sequence being provided in SEQ ID NO: 161. Further analysis of L762P has 
determined it to be a type I membrane protein and two additional variants have been 
sequenced. Variant 1 (SEQ ID NO: 167, with the corresponding amino acid sequence 
in SEQ ID NO: 169) is an alternatively spliced form of SEQ ID NO: 160 resulting in 
deletion of 503 nucleotides, as well as deletion of a short segment of the expressed 

30 protein. Variant 2 (SEQ ID NO: 168, with the corresponding amino acid sequence in 
SEQ ID NO: 170) has a two nucleotide deletion at the 3' coding region in comparison 
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to SEQ ID NO: 160, resulting in a secreted form of the expressed protein. Real-time 
RT-PCR analysis of L762P revealed that is over-expressed in 3/4 lung squamous 
tumors and 4/4 head & neck tumors, with low level expression being observed in 
normal skin, soft pallet and trachea. 
5 An epitope of L762P was identified as having the sequence 

KPGHWTYTLNNTHHSLQALK (SEQ ID NO: 382), which corresponds to amino 
acids 571-590 of SEQ IDNO:161. 

The full-length cDNA sequence for contig 56 (SEQ ID NO: 148), also 
referred to as L773P, is provided in SEQ ID NO: 171, with the amino acid sequence in 

10 SEQ ID NO: 172. L773P was found to be identical to dihydroxyl dehydrogenase at the 
3' portion of the gene, with divergent 5' sequence. As a result, the 69 N-terminal amino 
acids are unique. The cDNA sequence encoding the 69 N-terminal amino acids is 
provided in SEQ ID NO: 349, with the N-terminal amino acid sequence being provided 
in SEQ ID NO: 350. Real-time PCR revealed that L773P is highly expressed in lung 

15 squamous tumor and lung adenocarcinoma, with no detectable expression in normal 
tissues. Subsequent Northern blot analysis of L773P demonstrated that this transcript is 
differentially over-expressed in squamous tumors and detected at approximately 1.6 Kb 
in primary lung tumor tissue and approximately 1 .3 Kb in primary head and neck tumor 
tissue. 

20 Subsequent microarray analysis has shown Contig 58, also referred to as 

L769S (SEQ ID NO: 150), to be overexpressed in breast tumors in addition to lung 
squamous tumors. 

EXAMPLE 4 

Isolation and Characterization of Lung Tumor Polypeptides 
25 by PCR-Based Subtraction 

Seven hundred and sixty clones from a cDNA subtraction library, 
containing cDNA from a pool of two human lung primary adenocarcinomas subtracted 
against a pool of nine normal human tissue cDNAs including skin, colon, lung, 
esophagus, brain, kidney, spleen, pancreas and liver, (Clontech, Palo Alto, CA) were 
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derived and submitted to a first round of PCR amplification. This library (referred to as 
ALT-1) was subjected to a second round of PCR amplification, following the 
manufacturer's protocol. The expression levels of these 760 cDNA clones in lung 
tumor, normal lung, and various other normal and tumor tissues, were examined using 
5 microarray technology (Incyte, Palo Alto, CA). Briefly, the PCR amplification products 
were dotted onto slides in an array format, with each product occupying a unique 
location in the array. mRNA was extracted from the tissue sample to be tested, reverse 
transcribed, and fluorescent-labeled cDNA probes were generated. The microarrays 
were probed with the labeled cDNA probes, the slides scanned and fluorescence 

10 intensity was measured. This intensity correlates with the hybridization intensity.. A 
total of 118 clones, of which 55 were unique, were found to be over-expressed in lung 
tumor tissue, with expression in normal tissues tested (lung, skin, lymph node, colon, 
liver, pancreas, breast, heart, bone marrow, large intestine, kidney, stomach, brain, small 
intestine, bladder and salivary gland) being either undetectable, or at significantly lower 

15 levels. One of these clones, having the sequence as provided in SEQ ID NO:420 (clone 
#19014), shows homology to a previously identified clone, L773P. Clone L773P has 
the full-length cDNA sequence provided in SEQ ID NO: 171 and the amino acid 
sequence provided in SEQ ID NO: 172 The isolation of clone #19014 is also described 
in co-pending U.S. Patent application 09/285,479, filed April 2, 1999. 

20 EXAMPLE 5 

Synthesis of Polypeptides 

Polypeptides may be synthesized on a Perkin Elmer/ Applied Biosystems 
Division 43 OA peptide synthesizer using FMOC chemistry with HPTU (O- 
Benzotriazole-N,N,N',N'-tetramethyluronium hexafluorophosphate) activation. A Gly- 
25 Cys-Gly sequence may be attached to the amino terminus of the peptide to provide a 
method of conjugation, binding to an immobilized surface, or labeling of the peptide. 
Cleavage of the peptides from the solid support is carried out using the following 
cleavage mixture: trifluoroacetic acid:ethanedithiol:thioanisole:water:phenol 
(40:1 :2:2:3). After cleaving for 2 hours, the peptides are precipitated in cold methyl-t- 
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butyl-ether. The peptide pellets are then dissolved in water containing 0.1% 
trifluoroacetic acid (TFA) and lyophilized prior to purification by CI 8 reverse phase 
HPLC. A gradient of 0%-60% acetonitrile (containing 0.1% TFA) in water (containing 
0.1% TFA) may be used to elute the peptides. Following lyophilization of the pure 
5 fractions, the peptides are characterized using electrospray or other types of mass 
spectrometry and by amino acid analysis. 

EXAMPLE 6 

Preparation of Antibodies Against Lung Cancer Antigens 

Polyclonal antibodies against the lung cancer antigens L514S, L528S, 

10 L531S, L523 and L773P (SEQ ID NO: 155, 225, 1 12, 176 and 171, respectively) were 
prepared as follows. 

Rabbits were immunized with recombinant protein expressed in and 
purified from E. coli as described below. For the initial immunization, 400 jag of 
antigen combined with muramyl dipeptide (MDP) was injected subcutaneously (S.C.). 

15 Animals were boosted S.C. 4 weeks later with 200 ug of antigen mixed with incomplete 
Freund's Adjuvant (IF A). Subsequent boosts of 100 \xg of antigen mixed with IFA 
were injected S.C. as necessary to induce high antibody titer responses. Serum bleeds 
from immunized rabbits were tested for antigen-specific reactivity using ELISA assays 
with purified protein. Polyclonal antibodies against L514S, L528S, L531S, L523S and 

20 L773P were affinity purified from high titer polyclonal sera using purified protein 
attached to a solid support. 

Immunohistochemical analysis using polyclonal antibodies against 
L514S was performed on a panel of 5 lung tumor samples, 5 normal lung tissue samples 
and normal colon, kidney, liver, brain and bone marrow. Specifically, tissue samples 

25 were fixed in formalin solution for 24 hours and embedded in paraffin before being 
sliced into 10 micron sections. Tissue sections were permeabilized and incubated with 
antibody for 1 hr. HRP-labeled anti-mouse followed by incubation with DAB 
chromogen was used to visualize L514S immunoreactivity. L514S was found to be 
highly expressed in lung tumor tissue with little or no expression being observed in 
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normal lung, brain or bone marrow. Light staining was observed in colon (epithelial 
crypt cells positive) and kidney (tubules positive). Staining was seen in normal liver but 
no mRNA has been detected in this tissue making this result suspect. 

Using the same procedure, immunohistochemical analysis using 
5 polyclonal antibodies against L528S demonstrated staining in lung tumor and normal 
lung samples, light staining in colon and kidney, and no staining in liver and heart. 

Immunohistochemical analysis using polyclonal antibodies against 
L531S demonstrated staining in lung tumor samples, light membrane staining in most 
normal lung samples, epithelial staining in colon, tubule staining in kidney, ductal 
1 0 epithelial staining in liver and no staining in heart. 

Immunohistochemical analysis using polyclonal antibodies against 
L523S demonstrated staining in all lung cancer samples tested but no staining in normal 
lung, kidney, liver, colon, bone marrow or cerebellum. 

Generation of polyclonal anti-sera against L762P (SEQ ID NO: 169 and 
15 170) was performed as follows. 400 micrograms of lung antigen was combined with 
100 micrograms of muramyldipeptide (MDP). An equal volume of incomplete 
Freund's Adjuvant (IF A) was added and then mixed until an emulsion was formed. 
Rabbits were injected subcutaneously (S.C.). After four weeks the animals were 
injected S.C. with 200 micrograms of antigen mixed with an equal volume of IF A. 
20 Every four weeks animals were boosted with 100 micrograms of antigen. Seven days 
following each boost the animal was bled. Sera was generated by incubating the blood 
at 4°C for 12-24 hours followed by centrifugation. 

Characterization of polyclonal antisera was carried out as follows. 
Ninety-six well plates were coated with antigen by incubing with 50 microliters 
25 (typically 1 microgram) at 4°C for 20 hrs. 250 microliters of BSA blocking buffer was 
added to the wells and incubated at room temperature for 2 hrs. Plates were washed 6 
times with PBS/0.01% Tween. Rabbit sera was diluted in PBSand 50 microliters of 
diluted sera was added to each well and incubated at room temperature for 30 min. 
. Plates were washed as described above before addition of 50 microliters of goat anti- 
30 rabbit horse radish peroxidase (HRP) at a 1:10000 dilution and incubation at room 
temperature for 30 min. Plates were washed as described above and 100(4.1 of TMB 
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Microwell Peroxidase Substrate was added to each well. Following a 15 minute 
incubation in the dark at room temperature, the colorimetric reaction was stopped with 
lOOul IN H2SO4 and read immediately at 450 nm. Antisera showed strong reactivity to 
antigen L762P. 

5 Immunoliistochemical analysis using polyclonal antibodies against 

L762P demonstrated staining in all lung cancer samples tested, some light staining in 
the bronchiole epithelium of normal lung, tubule staining in kidney, light epithelial 
staining in colon and no staining in heart or liver. 

In order to evaluate L773P protein expression in various tissues, 

10 immunohistochemistry (IHC) analysis was performed using an affinity purified L773P 
polyclonal antibody. Briefly, tissue samples were fixed in formalin solution for 12-24 
hrs and embedded in paraffin before being sliced into 8 micron sections. Steam heat 
induced epitope retrieval (SHIER) in 0.1 M sodiuym citrate buffer (pH 6.0) was used 
for optimal staining conditions. Sections were incubated with 10% serum/PBS for 5 

15 minutes. Primary antibody was added to each section for 25 minutes at indicated 
concentrations followed by 25 minute incubation with either anti-rabbit or anti-mouse 
biotinylated antibody. Endogenous peroxidase activitiy was blocked by three 1 .5 minute 
incubations with hydrogen peroxidase. The avidin biotin complex/horse radish 
peroxidase (ABC/HRP) system was used along with DAB chromogen to visualize 

20 L773P expression. Slides were counterstainied with hematoxylin to visualize cell 
nuclei. Using this approach, L773P protein was detected in 6/8 lung tumors, 4/6 normal 
lung samples (very light staining in some cases), 1/1 kidney samples (very light 
staining), 0/1 heart samples, 1/1 colon samples (very light staining) and 0/1 liver 
samples. 

25 EXAMPLE 7 

Peptide Priming of Mice and Propagation of CTL Lines 

Immunogenic peptides from the lung cancer antigen L762P (SEQ ID 
NO: 161) for HLA-A2/K b -restricted CD8+ T cells were identified as follows. 
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The location of HLA-A2 binding peptides within the lung cancer antigen 
L762P (SEQ ID NO: 161) was predicted using a computer program which predicts 
peptides sequences likely to being to HLA-A*0201 by fitting to the known peptide 
binding motif for HLA-A*0201 (Rupert et al. (1993) Cell 74:929; Rammensee et al 
5 (1995) Immunogenetics 41: 178-228). A series of 19 synthetic peptides corresponding 
to a selected subset of the predicted HLA-A*0201 binding peptides was prepared as 
described above. 

Mice expressing the transgene for human HLA A2/K b (provided by Dr L. 
Sherman, The Scripps Research Institute, La Jolla, CA) were immunized with the 

10 synthetic peptides, as described by Theobald et al., Proc. Natl. Acad. Sci. USA 
92:1 1993-1 1997, 1995, with the following modifications. Mice were immunized with 
50pg of L726P peptide and 120ug of an I-A b binding peptide derived from hepatitis B 
virus protein emulsified in incomplete Freund's adjuvant. Three weeks later these mice 
were sacrificed and single cell suspensions prepared. Cells were then resuspended at 7 

15 x 10 6 cells/ml in complete media (RPMI-1640; Gibco BRL, Gaithersburg, MD) 
containing 10% FCS, 2mM Glutamine (Gibco BRL), sodium pyruvate (Gibco BRL), 
non-essential amino acids (Gibco BRL), 2 x 10" 5 M 2-mercaptoethanol, 50U/ml 
penicillin and streptomycin, and cultured in the presence of irradiated (3000 rads) 
L762P peptide- (5ug/ml) and lOmg/ml B 2 -microglobulin- (3 ug/ml) LPS blasts (A2 

20 transgenic spleens cells cultured in the presence of 7ug/ml dextran sulfate and 25ug/ml 
LPS for 3 days). After six days, cells (5 x 10 5 /ml) were restimulated with 2.5 x 10 6 /ml 
peptide-pulsed irradiated (20,000 rads) EL4A2Kb cells (Sherman et al, Science 
255:815-818, 1992) and 5 x 10 6 /ml irradiated (3000 rads) A2/K b -transgenic spleen 
feeder cells. Cells were cultured in the presence of lOU/ml IL-2. Cells were 

25 restimulated on a weekly basis as described, in preparation for cloning the line. 

Peptide-specific cell lines were cloned by limiting dilution analysis with 
irradiated (20,000 rads) L762P peptide-pulsed EL4 A2Kb tumor cells (1 x 10 4 
cells/well) as stimulators and irradiated (3000 rads) A2/K b -transgenic spleen cells as 
feeders (5 x 10 5 cells/ well) grown in the presence of lOU/ml IL-2. On day 7, cells were 

30 restimulated as before. On day 14, clones that were growing were isolated and 
maintained in culture. 
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Cell lines specific for the peptides L762P-87 (SEQ ID NO: 226; 
corresponding to amino acids 87-95 of SEQ ID NO: 161), L762P-145 (SEQ ID NO: 
227; corresponding to amino acids 145-153 of SEQ ID NO: 161), L762P-585 (SEQ ID 
NO: 228; corresponding to amino acids 585-593 of SEQ ID NO: 161), L762P-425 
5 (SEQ ID NO: 229; corresponding to amino acids 425-433 of SEQ ID NO: 161), 
L762P(10)-424 (SEQ ID NO: 230; corresponding to amino acids 424-433 of SEQ ID 
NO: 161) and L762P(10)-458 (SEQ ID NO: 231; corresponding to amino acids 458-467 
of SEQ ID NO: 161) demonstrated significantly higher reactivity (as measured by 
percent specific lysis) against L762P peptide-pulsed EL4-A2/K b tumor target cells than 
1 0 control peptide-pulsed EL4- A2/K b tumor target cells . 

EXAMPLE 8 

Identification of CD4 Immunogenic T Cell Epitopes Derived 
From the Lung Cancer Antigen L762p 

CD4 T cell lines specific for the antigen L762P (SEQ ID NO: 161) were 

1 5 generated as follows. 

A series of 28 overlapping peptides were synthesized that spanned 
approximately 50% of the L762P sequence. For priming, peptides were combined into 
pools of 4-5 peptides, pulsed at 20 micrograms/ml into dendritic cells for 24 hours. The 
dendritic cells were then washed and mixed with positively selected CD4+ T cells in 96 

20 well U-bottomed plates. Forty cultures were generated for each peptide pool. Cultures 
were restimulated weekly with fresh dendritic cells loaded with peptide pools. 
Following a total of 3 stimulation cycles, cells were rested for an additional week and 
tested for specificity to antigen presenting cells (APC) pulsed with peptide pools using 
interferon-gamma ELISA and proliferation assays. For these assays, adherent 

25 monocytes loaded with either the relevant peptide pool or an irrelevant peptide were 
used as APC. T cell lines that appeared to specifically recognize L762P peptide pools 
both by cytokine release and proliferation were identified for each pool. Emphasis was 
placed on identifying T cells with proliferative responses. T cell lines that demonstrated 
either both L762P-specific cytokine secretion and proliferation, or strong proliferation 
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alone were further expanded to be tested for recognition of individual peptides from the 
pools, as well as for recognition of recombinant L762P. The source of recombinant 
L762P was E. coli, and the material was partially purified and endotoxin positive. 
These studies employed 10 micrograms of individual peptides, 10 or 2 micrograms of 
5 an irrelevant peptide, and 2 or 0.5 micrograms of either L762P protein or an irrelevant, 
equally impure, E. coli generated recombinant protein. Significant interferon-gamrna 
production and CD4 T cell proliferation was induced by a number of L762P-derived 
peptides in each pool. The amino acid sequences for these peptides are provided in 
SEQ ID NO: 232-251. These peptides correspond to amino acids 661-680, 676-696, 

10 526-545, 874-893, 811-830, 871-891, 856-875, 826-845, 795-815, 736-755, 706-725, 
706-725, 691-710, 601-620, 571-590, 556-575, 616-635, 646-665, 631-650, 541-560 
and 586-605, respectively, of SEQ ID NO: 161. 

CD4 T cell lines that demonstrated specificity for individual L762P- 
derived peptides were further expanded by stimulation with the relevant peptide at 10 

15 micrograms/ml. Two weeks post-stimulation, T cell lines were tested using both 
proliferation and IFN-gamrna ELISA assays for recognition of the specific peptide. A 
number of previously identified T cells continued to demonstrate L762P-peptide 
specific activity. Each of these lines was further expanded on the relevant peptide and, 
following two weeks of expansion, tested for specific recognition of the L762P-peptide 

20 in titration experiments, as well as for recognition of recombinant E. co/z-derived L762P 
protein. For these experiments, autologous adherent monocytes were pulsed with either 
the relevant L762P-derived peptide, an irrelevant mammaglobin-derived peptide, 
recombinant E. co/z-derived L762P (approx. 50% pure), or an irrelevant E. co/z-derived 
protein. The majority of T cell lines were found to show low affinity for the relevant 

25 peptide, since specific proliferation and IFN-gamma ratios dramatically decreased as 
L762P peptide was diluted. However, four lines were identified that demonstrated 
significant activity even at 0.1 micrograms/ml peptide. Each of these lines (referred to 
as A/D5, D/F5, E/A7 and E/B6) also appeared to specifically proliferate in response to 
the E. co/z'-derived L762P protein preparation, but not in response to the irrelevant 

30 protein preparation. The amino acid sequences of the L762P-derived peptides 
recognized by these lines are provided in SEQ ID NO: 234, 249, 236 and 245, 
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respectively. No protein specific IFN-gamma was detected for any of the lines. Lines 
A/D5, E/A7 and E/B6 were cloned on autologous adherent monocytes pulsed with the 
relevant peptide at 0.1 (A/D5 and E/A7) or 1 (D/F5) microgram/ml. Following growth, 
clones were tested for specificity for the relevant peptide. Numerous clones specific for 
5 the relevant peptide were identified for lines A/D5 and E/A7. 

EXAMPLE 9 

Protein Expression of Lung Tumor-Specific Antigens 

a) Expression of L514S inE. coli 

The lung tumor antigen L514S (SEQ ID NO: 89) was subcloned into the 
10 expression vector pE32b at Ncol and NotI sites, and transformed into E. coli using 
standard techniques. The protein was expressed from residues 3-153 of SEQ ID NO: 
89. The expressed amino acid sequence and the corresponding DNA sequence are 
provided in SEQ ID NO: 252 and 253, respectively. 

b) Expression of L762P 

1 5 Amino acids 32-944 of the lung tumor antigen L762P (SEQ ID NO: 161), with a 6X His 
Tag, were subcloned into a modified pET28 expression vector, using kanamycin 
resistance, and transformed into BL21 CodonPlus using standard techniques. Low to 
moderate levels of expression were observed. The determined DNA sequence of the 
L762P expression construct is provided in SEQ ID NO: 254. 

20 EXAMPLE 10 

Identification of MHC class II restricting Allele for L762P Peptide-Specific 
Responses 

A panel of HLA mismatched antigen presenting cells (APC) were used 
to identify the MHC class II restricting allele for the L762P-peptide specific responses 
25 of CD4 T cell clones derived from lines that recognized L762P peptide and recombinant 
protein. Clones from two lines, AD-5 and EA-7, were tested as described below. The 



133 



WO 02/47534 



PCT7US01/47576 



AD-5 derived clones were found to be restricted by the HLA-DRB-1 101 allele, and an 
EA-7 derived clone was found to be restricted by the HLA DRB-0701 or DQB 1-0202 
allele. Identification of the restriction allele allows targeting of vaccine therapies using 
the defined peptide to individuals that express the relevant class II allele. Knowing the 
5 relevant restricting allele will also enable clinical monitoring for responses to the 
defined peptide since only individuals that express the relevant allele will be monitored. 

CD4 T cell clones derived from line AD-5 and EA-7 were stimulated on 
autologous APC pulsed with the specific peptide at 10 |ig/ml, and tested for recognition 
of autologous APC (from donor D72) as well as against a panel of APC partially 

10 matched with D72 at class II alleles. Table 2 shows the HLA class typing of the APC 
tested. Adherent monocytes (generated by 2 hour adherence) from four different 
donors, referred to as D45, D187, D208, and D326, were used as APC in these 
experiments. Autologous APC were not included in the experiment. Each of the APC 
were pulsed with the relevant peptide (5a for AD-5 and 3e for 3A-7) or the irrelevant 

15 mammoglobin peptide at 10 \ig/ml, and cultures were established for 10,000 T cells and 
about 20,000 APC/well. As shown in Table 3, specific proliferation and cytokine 
production could be detected only when partially matched donor cells were used as 
APC. Based on the MHC typing analysis, these results strongly suggest that the 
restricting allele for the L762-specific response of the AD-5 derived clones is HLA- 

20 DRB-1101 and for the EA-7 derived clone the restricting allele is HLA DRB-0701 or 
DQB 1-0202. 



Table 2 - HLA Typing of APC 
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EXAMPLE 11 

Fusion Proteins ofN-Terminal and C-Terminal Portions of L763P 

In another embodiment, a Mycobacterium tuberculosis-derived 
polynucleotide, referred to as Ral2, is linked to at least an immunogenic portion of a 
polynucleotide of this invention. Ral2 compositions and methods for their use in 
enhancing expression of heterologous polynucleotide sequences are described in U.S. 
Patent Application 60/158,585, the disclosure of which is incorporated herein by 
reference in its entirety. Briefly, Ral2 refers to a polynucleotide region that is a 
subsequence of a Mycobacterium tuberculosis MTB32A nucleic acid. MTB32A is a 
serine protease of 32 KD molecular weight encoded by a gene in virulent and avirulent 
strains of M. tuberculosis. The nucleotide sequence and amino acid sequence of 
MTB32A have been described (for example, U.S. Patent Application 60/158,585; see 
also, Skeiky et ah, Infection and Immun. (1999) 67:3998-4007, incorporated herein by 
reference). Surprisingly, it was discovered that a 14 KD C-terminal fragment of the 
MTB32A coding sequence expresses at high levels on its own and remains as a soluble 
protein throughout the purification process. Moreover, this fragment may enhance the 
immunogenicity of heterologous antigenic polypeptides with which it is fused. This 14 
KD C-terminal fragment of the MTB32A is referred to herein as Ral2 and represents a 
fragment comprising some or all of amino acid residues 192 to 323 of MTB32A. 

Recombinant nucleic acids which encode a fusion polypeptide 
comprising a Ral2 polypeptide and a heterologous lung tumor polypeptide of interest, 
can be readily constructed by conventional genetic engineering techniques. 
Recombinant nucleic acids are constructed so that, preferably, a Ral2 polynucleotide 
sequence is located 5' to a selected heterologous lung tumor polynucleotide sequence. 
It may also be appropriate to place a Ral2 polynucleotide sequence 3' to a selected 
heterologous polynucleotide sequence or to insert a heterologous polynucleotide 
sequence into a site within a Ral2 polynucleotide sequence. 

In addition, any suitable polynucleotide that encodes a Ra 12 or a portion 
or other variant thereof can be used in constructing recombinant fusion polynucleotides 
comprising Ral2 and one or more lung tumor polynucleotides disclosed herein. 
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Preferred Ral2 polynucleotides generally comprise at least about 15 consecutive 
nucleotides, at least about 30 nucleotides, at least about 60 nucleotides, at least about 
100 nucleotides, at least about 200 nucleotides, or at least about 300 nucleotides that 
encode a portion of a Ral2 polypeptide. 

Ral2 polynucleotides may comprise a native sequence (i.e., an 
endogenous sequence that encodes a Ral2 polypeptide or a portion thereof) or may 
comprise a variant of such a sequence. Ral2 polynucleotide variants may contain one 
or more substitutions, additions, deletions and/or insertions such that the biological 
activity of the encoded fusion polypeptide is not substantially diminished, relative to a 
fusion polypeptide comprising a native Ral2 polypeptide. Variants preferably exhibit at 
least about 70% identity, more preferably at least about 80% identity and most 
preferably at least about 90% identity to a polynucleotide sequence that encodes a native 
Ral2 polypeptide or a portion thereof. 

Two specific embodiments of fusions between Ral2 and antigens of the 
present invention are described in this example. 

A. N-Terminal Portion of L763P 

A fusion protein of full-length Ral2 and the N-terminal portion of L763P (referred to as 
L763P-N; amino acid residues 1-130 of SEQ ID NO: 159) was expressed as a single 
recombinant protein in E. coli. The cDNA for the N-terminal portion was obtained by 
PGR with a cDNA for the full length L763P and primers L763F3 (5' 
CGGCGAATTCATGGATTGGGGGACGCTGC; SEQ ID NO: 383) and 1763RV3 (5' 
CGGCCTCGAGTCACCCCTCTATCCGAACCTTCTGC; SEQ ID NO: 384). The 
PGR product with expected size was recovered from agarose gel, digested with 
restriction enzymes EcoRI and Xhol, and cloned into the corresponding sites in the 
expression vector pCRXl. The sequence for the fusion of full-length of Ral2 and 
L763P-N was confirmed by DNA sequencing. The determined cDNA sequence is 
provided in SEQ ID NO:351, with the corresponding amino acid sequence being 
provided in SEQ ID NO: 352). 
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B. C-Terminal Portion of L763P 

A fusion protein of full-length Ral2 and the C-terminal portion of L763P 
(referred to as L763P-C; amino acid residues 100-262 of SEQ ID NO: 159) was 
expressed as a single recombinant protein in E. coli. The cDNA of the C-terminal 
portion of L763P was obtained by PCR with a cDNA for the full length of L763P and 
primers L763F4 (5' CGGCGAATTCCACGAACCACTCGCAAGTTCAG; SEQ ID 
NO: 385) and L763RV4 (5' CGGCTCGAG-TTAGCTTGGGCCTGTGATTGC; SEQ 
ID NO: 386). The PCR product with expected size was recovered from agarose gel, 
digested with restriction enzymes EcoRI and Xhol, and cloned into the corresponding 
sites in the expression vector pCRXl. The sequence for the fusion of full-length Ral2 
and L763P-C was confirmed by DNA sequencing. The determined DNA sequence is 
provided in SEQ ID NO:353, with the corresponding amino acid sequence being 
provided in SEQ ID NO: 354. 

The recombinant proteins described in this example are useful for the 
preparation of vaccines, for antibody therapeutics, and for diagnosis of lung tumors. 

EXAMPLE 12 

Expression in E. Coli of L762P His Tag Fusion Protein 

PCR was performed on the L762P coding region with the following 

primers: 

Forward primer starting at amino acid 32. 

PDM-278 5 'ggagtacagcttcaagacaatggg 3 ' (SEQ ID NO:355) Tm 57°C. 
Reverse primer including natural stop codon after amino acid 920, 
creating EcoRI site 

PDM-280 5'ccatgggaattcattataataattttgttcc 3' (SEQ ID NO:356) 

TM55°C. 

The PCR product was digested with EcoRI restriction enzyme, gel 
purified and then cloned into pPDM His, a modified pET28 vector with a His tag in 
frame, which had been digested with Eco72I and EcoRI restriction enzymes. The 
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correct construct was confirmed by DNA sequence analysis and then transformed into 
BL21 (DE3) pLys S and BL21 (DE3) CodonPlus RIL expression hosts. 

The protein sequence of expressed recombinant L762P is shown in SEQ 
ID NO:357, and the DNA sequence is shown in SEQ ID NO:358. 

EXAMPLE 13 

Expression in E. Coli of a L773PA His Tag Fusion Protein 

The L773PA coding region (encoding amino acids 2-71 of SEQ ID NO: 
172) was PCR amplified using the following primers: 

Forward primer for L773PA starting at amino acid 2: 

PDM-299 5'tggcagcccctcttcttcaagtggc 3' (SEQ ID NO:359) Tm63°C. 

Reverse primer for L773PA creating artificial stop codon after amino 

acid 70: 

PDM-355 5'cgccagaatteatcaaacaaatctgttagcacc 3' (SEQ ID NO:360) 

Tm62°C. 

The resulting PCR product was digested with EcoRI restriction enzyme, 
gel purified and then cloned into pPDM His, a modified pET28 vector with a His tag in 
frame, which had been digested with Eco72I and EcoRI restriction enzymes. The 
correct construct was confirmed by DNA sequence analysis and transformed into BL21 
(DE3) pLys S and BL21 (DE3) CodonPlus RIL expression hosts. 

The protein sequence of expressed recombinant L773PA is shown in 
SEQ ID NO:361, and the DNA sequence is shown in SEQ ID NO:362. 

EXAMPLE 14 

Identification of Epitopes Derived From Lung Tumor Specific Polypeptides 

A series of peptides from the L773P amino acid sequence (SEQ ID NO: 
172) were synthesized and used in in vitro priming experiments to generate peptide- 
specific CD4 T cells. These peptides were 20-mers that overlapped by 15 amino acids 
and corresponded to amino acids 1-69 of the L773P protein. This region has been 
demonstrated to be tumor-specific. Following three in vitro stimulations, CD4 T cell 
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lines were identified that produced IFNy in response to the stimulating peptide but not 
the control peptide. Some of these T cell lines demonstrated recognition of recombinant 
L773P and L773PA (tumor-specific region) proteins. 

To perform the experiments, a total of eleven 20-mer peptides (SEQ ID 
NO: 363, 365 and 387-395) overlapping by 15 amino acids and derived from the N- 
terminal tumor-specific region of L773P (corresponding to amino acids 1-69 of SEQ ID 
NO: 172) were generated by standard procedures. Dendritic cells were derived from 
PBMC of a normal donor using GMCSF and IL-4 by standard protocol. Purified CD4 T 
cells were generated from the same donor as the dendritic cells using MACS beads and 
negative selection of PBMCs. Dendritic cells were pulsed overnight with the individual 
20-mer peptides at a concentration of 10 ug/ml. Pulsed dendritic cells were washed and 
plated at 1 x 10 4 /well of a 96-well U-bottom plates, and purified CD4 cells were added 
at 1 x 10 5 well. Cultures were supplemented with 10 ng/ml IL-6 and 5 ng/ml IL-12, and 
incubated at 37°C. Cultures were re-stimulated as above on a weekly basis using as 
APC dendritic cells generated and pulsed as above, supplemented with 5 ng/ml IL-7 and 
10 |ig/ml IL-2. Following 3 in vitro stimulation cycles, cell lines (each corresponding to 
one well) were tested for cytokine production in response to the stimulating peptide vs. 
an irrelevant peptide. 

A small number of individual CD4 T cell lines (9/528) demonstrated 
cytokine release (IFNy) in response to the stimulating peptide but not to control peptide. 
The CD4 T cell lines that demonstrated specific activity were restimulated on the 
appropriate L773P peptide and reassayed using autologous dendritic cells pulsed with 
10 ug/ml of the appropriate L773P peptide, an irrelevant control peptide, recombinant 
L773P protein (amino acids 2-364, made in E. coif), recombinant L773PA (amino acids 
2-71, made in E. coif), or an appropriate control protein (L3E, made in E. coif). Three 
of the nine lines tested (1-3C, 1-6G, and 4-12B) recognized the appropriate L773P 
peptide as well as recombinant L773P and L773PA. Four of the lines tested (4-8A, 4- 
8E, 4-12D, and 4-12E) recognized the appropriate L773P peptide only. Two of the 
lines tested (5-6F and 9-3B) demonstrated non-specific activity. 

These results demonstrate that the peptide sequences 
MWQPLFFKWLLSCCPGSSQI (amino acids 1-20 of SEQ ID NO: 172; SEQ ID 
NO:363) and GSSQIAAAASTQPEDDINTQ (amino acids 16-35 of SEQ ID NO: 172; 



140 



WO 02/47534 



PCT7US01/47576 



SEQ ID NO: 365) may represent naturally processed epitopes of L773P, which are 
capable of stimulating human class II MHC-restricted CD4 T cell responses. 

In subsequent studies, the above epitope mapping experiment was 
repeated using a different donor. Again, some of the resulting T cell lines were found to 
respond to peptide and recombinant protein. An additional peptide was found to be 
naturally processed. Specifically, purified CD4 cells were stimulated on a total of 
eleven 20-mer peptides overlapping by 15 amino acids (SEQ ID NO: 363, 387, 388, 365 
and 389-395, respectively). The priming was carried out as described above, except that 
a peptide concentration of 0.5 ug/mL rather than 10 ug/mL was employed. In the initial 
screen of the cell lines 9 of the 528 lines released at least a three-fold greater level of 
IFN-gamma with stimulating peptide vs. control peptide. These 9 lines were 
restimulated on the appropriate peptide and then tested on dendritic cells pulsed with a 
titration of appropriate peptide (10 ug/mL, 1 ug/mL and 0.1 ug/mL), and 10 ug/mL of a 
control peptide. Six of the 9 lines recognized recombinant L773P as well as peptide. 
The six lines referred to as 1-1E, 1-2E, 1-4H, 1-6A, 1-6G and 2-12B recognized 
L773PA and the appropriate peptide. These results demonstrate that the peptides of 
SEQ ID NO: 363 and 387 represent naturally processed epitopes of L773P. 

Using the procedures described above, CD4+ T cell responses were 
generated from PBMC of normal donors using dendritic cells pulsed with overlapping 
20-mer peptides (SEQ ID NO: 396-419) spanning the L523S polypeptide sequence 
(SEQ ID NO: 176). A number of CD4+ T cells demonstrated reactivity with the 
priming peptides as well as with L523S recombinant protein, with the dominant 
reactivity of these lines being within the peptides 4, 7 and 21 (SEQ ID NO: 399, 402 
and 416; corresponding to amino acids 30-39, 60-79 and 200-219, respectively, of SEQ 
ID NO: 176). 

Epitopes within the scope of the invention include epitopes restricted by 
other class II MHC molecules. In addition, variants of the peptide can be produced 
wherein one or more amino acids are altered such that there is no effect on the ability of 
the peptides to bind to MHC molecules, no effect on their ability to elicit T cell 
responses, and no effect on the ability of the elicited T cells to recognize recombinant 
protein. 



141 



WO 02/47534 



PCT7US01/47576 



EXAMPLE 15 

Surface Expression of L762P AND Antibody Epitopes Thereof 

Rabbits were immunized with full-length histidine-tagged L762P protein 
generated in E. coli. Sera was isolated from rabbits and screened for specific 
recognition of L762P in ELISA assays. One polyclonal serum, referred to as 2692L, was 
identified that specifically recognized recombinant L762P protein. The 2692L anti- 
L762P polyclonal antibodies were purified from the serum by affinity purification using 
L762P affinity columns. Although L762P is expressed in a subset of primary lung tumor 
samples, expression appears to be lost in established lung tumor cell lines. Therefore, 
to characterize surface expression of L762P, a retrovirus construct that expresses L762P 
was used to transduce primary human fibroblasts as well as 3 lung tumor cell lines (522- 
23, HTB, and 343T). Transduced lines were selected and expanded to examine L762P 
surface expression by FACS analysis. For this analysis, non-transduced and transduced 
cells were harvested using cell dissociation medium, and incubated with 10-50 
micrograms/ml of either affinity purified anti-L762P or irrelevant antisera. Following a 
30 minute incubation on ice, cells were washed and incubated with a secondary, FITC 
conjugated, anti rabbit IgG antibody as above. Cells were washed, resuspended in buffer 
with Propidium Iodide (PI) and examined by FACS using an Excalibur fluorescence 
activated cell sorter. For FACS analysis, Pi-positive (i.e. dead/permeabilized cells) were 
excluded. The polyclonal anti-L762P sera specifically recognized and bound to the 
surface of L762P-transduced cells but not the non-transduced counterparts. These 
results demonstrate that L762P is localized to the cell surface of both fibroblasts as well 
as lung tumor cells. 

To identify the peptide epitopes recognized by 2692L, an epitope 
mapping approach was pursued. A series of overlapping 19-21 mers (5 amino acid 
overlap) was synthesized that spanned the C terminal portion of L762P (amino acids 
481-894 of SEQ ID NO: 161). In an initial experiment peptides were tested in pools. 
Specific reactivity with the L762P antiserum was observed with pools A, B, C, and E. 
To identify the specific peptides recognized by the antiserum, flat bottom 96 well 
microtiter plates were coated with individual peptides at 1 0 microgram/ml for 2 hours at 
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37 °C. Wells were then aspirated and blocked with phosphate buffered saline containing 
5% (w/v) milk for 2 hours at 37 °C, and subsequently washed in PBS containing 0.1% 
Tween 20 (PBST). Purified rabbit anti-L762P serum 2692L was added at 200 or 20 
ng/well to triplicate wells in PBST and incubated overnight at room temperature. This 
was followed by washing 6 times with PBST and subsequently incubating with HRP- 
conjugated donkey anti rabbit IgG (H+L)Affinipure F(ab') fragment at 1:2,000 for 60 
minutes. Plates were then washed, and incubated in tetramethyl benzidine substrate. 
Reactions were stopped by the addition of IN sulfuric acid and plates were read at 
450/570 nm using an ELISA plate reader. 

The resulting data, presented in Table 4 below, demonstrates that the 
L762P antisera recognized at least 6 distinct peptide epitopes from the 3' half of L762P. 
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Table 4 



ELISA activity (OD 450-570 



Peptide (starting amino 
acid of L762P) 


pool 


200 ng polyclonal 
serum 


20 ng polyclonal 
serum 


A (481) 


A f 


1.76 


1 0 


B (495) 


A 


0.14 


.06 


C(511) 


: E 


0.47 


0.18 


D (526) 


E 


0.11 


0.00 


E (541) 


A 


0.11 


0.04 


F (556) 


A 


0.04 


0.02 


G(571) 


A 


0.06 


0.02 


H (586) 


B 


0.1 


0.03 


1(601) 


B 


0.25 


0.06 


J (616) 


B 


0.1 


0.03 


K(631) 


E 


0.1 


0.08 


L (646) 


B 


0.28 


0.12 


M (661) 


B 


0.14 


0.03 


N (676) 


C 


0.12 


0.1 


0 (691) 


c 


1.1 


i) >3 


P (706) 


c 


0.1 


0.03 


Q(721) 


c 


0.11 


0.05 


R(736) 


E 


0.12 


0.04 


S (751) 


C 


0.15 


0.06 


U(781) 


D 


0.12 


0.06 


V (795) 


F 


0.07 


0.05 


X (826) 


D 


0.1 


0.03 


Y (841) 


D 


0.17 


0.07 


Z (856) 


D 


0.16 


0.08 


AA(871) 


F 


0.17 


0.05 


BB (874) 


F 


0.14 


' 0.11 


No peptide 




0.15 


0.045 



Individual peptides were identified from each of the pools, and 
additionally a weak reactivity was identified with peptide BB from pool F. The relevant 
peptide epitopes are summarized in the Table 5 below The amino acid sequences for 
peptides BB, O, L, I, A and C are provided in SEQ ID NO: 376-381, respectively, with 
the corresponding cDNA sequences being provided in SEQ ID NO: 373, 370, 372, 374, 
371 and 375, respectively. 
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Table 5 
ELISA activity 
(OD 450-570) 



Peptide 


Nucleotides 
ofL762P 


Amino 
acids of 
L762P 


Sequence 


pool 


200 ng 


20 ng 


A 


1441-1500 


481-500 


SRISSGTGDIFQQHIQLEST 


A 


1.76 


1.0 


C 


1531-1590 


511-530 


KNTVTVDNTVGNDTMFLVTW 


E 


0.47 


0.18 


I 


1801-1860 


601-620 


AVPPATVEAFVERDSLHFPH 


B 


0.25 


0.06 


L 


1936-1955 


646-665 


PETGDPVTLRLLDDGAGADV 


B 


0.28 


0.12 


0 


2071-2130 


691-710 


VNHSPSISTPAHSIPGSHAM1L 


C 


1.1 


0.23 


BB 


2620-2679 


874-893 


LQSAVSNIAQAPLFIPPNSD 


F 


0.14 


0.11 


None 










0.15 


0.05 



EXAMPLE 16 

Detection of Antibodies Against Lung Tumor Antigens in Patient Sera 

Antibodies specific for the lung tumor antigens L773PA (SEQ ID 
NO:361), L514S (SEQ ID NO:155 and 156), L523S (SEQ ID NO:176), L762P (SEQ ID 
NO:161) and L763P (SEQ ID NO:159) were shown to be present in effusion fluid or 
sera of lung cancer patients but not in normal donors. More specifically, the presence of 
antibodies against L773PA, L514S, L523S, L762P and L763P in effusion fluid obtained 
from lung cancer patients and in sera from normal donors was detected by ELISA using 
recombinant proteins and HRP-conjugated anti-human Ig. Briefly, each protein (100 
ng) was coated in 96-well plate at pH 9.5. In parallel, BSA (bovine serum albumin) was 
also coated as a control protein. The signals ([S], absorbance measured at 405 ran) 
against BSA ([N]) were determined. The results of these studies are shown in Table 6, 
wherein - represents [S]/[N] < 2; +/- represents [S]/[N] >2; ++ represents [Sj/[N] >3; 
and +++ represents [S]/[N] >5. 
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Detection of Antibodies Against Lung Tumor Antigens 





L514S 


L523S 


L762P 


L763P 


L773PA 


Effusion fluid 












#1 


+++ 


++ 


++ 


- 


++ 


#2 


- 


- 


+/- 


++ 


+/- 


#3 


- 


- 


- 


- 


+/- 


#4 


+/- 


++ 


+/- 


- 


+/- 


#5 


+/- 


+++ 


+/- 


+/- 


++ 


#7 


- 


+/- 




- 


+/- 


#8 


- 


+++ 


- 


- 


++ 


#10 


- 


++ 


+/- 


+/- 


- 


#11 


+/- 


++ 


++ 


- 


++ 


#12 


+++ 


+/- 


- 


+/- 


+/- 


#13 


- 


+/- 


- 


- 


+/- 


#14 


- 


+++ 


+/- 


+/- 


++ 


#15 


+/- 


++ 


+/- 


- 


++ 


#17 


- 


+/- 


- 


- 


+/- 


#18 


- 


++ 


- 


- 


- 


#19 




+/- 






+/- 


#20 


L +/ - 


+/- 


+/- 




+A 


Normal sera 












#21 




+/- 








#22 ' 












#23 










+/- 


#24 




+/- 








#25 


+/- 


+/- 






+/- 



Using Western blot analyses, antibodies against L523S were found to be 
present in 3 out of 4 samples of effusion fluid from lung cancer patients, with no L523S 
antibodies being detected in the three samples of normal sera tested. 



EXAMPLE 17 

Expression in K Coli of a L5 14S His Tag Fusion Protein 

PCR was performed on the L514S-13160 coding region with the 
following primers: 

Forward primer PDM-278 5' cacactagtgtccgcgtggcggcctac 3' (SEQ ID 
NO:421) Tm 67°C. 
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Reverse primer PDM-280 5' catgagaattcatcacatgcccttgaaggctccc 3' 
(SEQ ID NO:422) TM 66°C. 

The PGR conditions were as follows: 

IOjliI 1 OX Pm buffer 

l.Oul lOmMdNTPs 

2.0ul 1 OuM each primer 

83ul sterile water 

1.5ul Ptu DNA polymerase (Stratagene, La Jolla, CA) 
50rig DNA 

96°C for 2 minutes, 96°C for 20 seconds, 66°C for 15 seconds, 72°C for 
1 minute with 40 cycles and then 72°C for 4 minutes. 

The PCR product was digested with EcoRI restriction enzyme, gel 
purified and then cloned into pPDM His, a modified pET28 vector with a His tag in 
frame, which had been digested with Eco72I and EcoRI restriction enzymes. The 
correct construct was confirmed by DNA sequence analysis and then transformed into 
BL21 CodonPlus (Stratagene, La Jolla, CA) cells for expression. 

The amino acid sequence of expressed recombinant L514S is shown in 
SEQ ID NO:423, and the DNA coding region sequence is shown in SEQ ID NO:424. 

EXAMPLE 18 

Expression in E. Coli of a L523S His Tag Fusion Protein 
PCR was performed on the L523S coding region with the following 

primers: 

Forward primer PDM-4 14 5' aacaaactgtatatcggaaacctcagcgagaa 3' (SEQ 
ID NO:425) Tm 62°C. 

Reverse primer PDM-415 5' ccatagaattcattacttccgtcttgactgagg 3' (SEQ 
ID NO:426) TM 62°C. 

The PCR conditions were as follows: 
lOul 1 OX Pfo buffer 
l.Oul lOmMdNTPs 
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2.0ul 10uM each primer 
83jj,1 sterile water 

1.5ul Pfu DNA polymerase (Stratagene, La Jolla, CA) 
50r|g DNA 

96°C for 2 minutes, 96°C for 20 seconds, 62°C for 15 seconds, 72°C for 
4 minutes with 40 cycles and then 72°C for 4 minutes. 

The PCR product was digested with EcoRI restriction enzyme, gel 
purified and then cloned into pPDM His, a modified pET28 vector with a His tag in 
frame, which had been digested with Eco72I and EcoRI restriction enzymes. The 
correct construct was confirmed by DNA sequence analysis and then transformed into 
BL21 CodonPlus (Stratagene, La Jolla, CA) cells for expression. 

The amino acid sequence of expressed recombinant L523S is shown in 
SEQ ID NO:427, and the DNA coding region sequence is shown in SEQ ID NO:428. 

EXAMPLE 19 

Expression in k Coli of a L762PA His Tag Fusion Protein 

PCR was performed on the L762PA coding region (L762PA is missing 
the signal sequence, the C-terminal transmembrane domain and the cytoplasmic tail) 
with the following primers: 

Forward primer PDM-278 5'ggagtacagcttcaagacaatggg 3' (SEQ ID 
NO:355) Tm 57°C. 

Reverse primer PDM-279 5'ccatggaattcattatttcaatataagataatctc 3' (SEQ 
IDNO:429)TM56°C. 

The PCR conditions were as follows: 
lOullOX Pfu buffer 
I.OjjlI lOmMdNTPs 
2.0(4,1 lOuM each primer 
83 ul sterile water 

1 .5ul Pfu DNA polymerase (Stratagene, La Jolla, CA) 
50qg DNA 
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96°C for 2 minutes, 96°C for 20 seconds, 55°C for 15 seconds, 72°C for 
5 minutes with 40 cycles and then 72°C for 4 minutes. 

The PCR product was digested with EcoRI restriction enzyme, gel 
purified and then cloned into pPDM His, a modified pET28 vector with a His tag in 
frame, which had been digested with Eco72I and EcoRI restriction enzymes. The 
correct construct was confirmed by DNA sequence analysis and then transformed into 
BL21 pLys S (Novagen, Madison, WI) cells for expression. 

The amino acid sequence of expressed recombinant L762PA is shown in 
SEQ ID NO:430, and the DNA coding region sequence is shown in SEQ ID NO:431. 

EXAMPLE 20 

Expression in e. Cou of a L773P His Tag Fusion Protein 
PCR was performed on the L773P coding region with the following 

primers: 

Forward primer PDM-299 5' tggcagcccctcttcttcaagtggc 3' (SEQ ID 
NO:359) Tm 63°C. 

Reverse primer PDM-300 5' cgcctgctcgagtcattaatattcatcagaaaatgg 3' 
(SEQIDNO:432)TM63°C. 

The PCR conditions were as follows: 
lOul 1 OX Pfu buffer 
l.Oul lOmMdNTPs 
2.0ul lOuM each primer 
83 ul sterile water 

1.5ul Pfu DNA polymerase (Stratagene, La Jolla, CA) 
50qg DNA 

96°C for 2 minutes, 96°C for 20 seconds, 63°C for 15 seconds, 72°C for 
2 minutes 15 seconds with 40 cycles and then 72°C for 4 minutes. 

The PCR product was digested with EcoRI restriction enzyme, gel 
purified and then cloned into pPDM His, a modified pET28 vector with a His tag in 
frame, which had been digested with Eco72I and EcoRI restriction enzymes. The 
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correct construct was confirmed by DNA sequence analysis and then transformed into 
BL21 pLys S (Novagen, Madison, WI) and BL21 CodonPlus (Stratagene, La Jolla, CA) 
cells for expression. 

The amino acid sequence of expressed recombinant L773P is shown in 
SEQ ID NO:433, and the DNA coding region sequence is shown in SEQ ID NO:434. 

EXAMPLE 21 

Cloning and Sequencing of a T-Cell Receptor Clone 
for the Lung Specific Antigen L762P 

T cell receptor (TCR) alpha and beta chains from a CD4 T cell clone 
specific for the lung specific antigen L762P were cloned and sequence. Basically, total 
niRNA from 2 X 10 6 cells from CTL clone 4H6 was isolated using Trizol reagent and 
cDNA was synthesized using Ready-to go kits (Pharmacia). To determine Valpha and 
Vbeta sequences of this clone, a panel of Valpha and Vbeta subtype specific primers 
was synthesized and used in RT-PCR reactions with cDNA generated from each of the 
clones. The RT-PCR reactions demonstrated that each of the clones expressed a 
common Vbeta sequence that corresponded to the Vbeta8 subfamily and a Valpha 
sequence that corresponded to the Valpha8 subfamily. To clone the full TCR alpha and 
beta chains from clone 4H6, primers were designed that spanned the initiator and 
terminator-coding TCR nucleotides. The primers were as follows: 

forward primer for TCR Valpha8 5' 

ggatccgccgccaccatgacatccattcgagctgta 3' (SEQ ID NO:435; has a BamHI site 

inserted); 

Kozak reverse primer for TCR Valpha8 (antisense) 5' 
gtcgactcagctggaccacagccgcag 3' (SEQ ID NO:436; has a Sail site inserted plus 
the TCR alpha constant sequence); 

forward primer for TCR Vbeta8 (sense) 5' 
ggatccgccgccaccatggactcctggaccttctgct 3' (SEQ ID NO:437; has a BamHI site 
inserted); and 
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Kozak reverse primer for TCR Vbeta 5' gtcgactcagaaatcctttctcttgac 3' 
(SEQ ID NO:438; has a Sail site inserted plus the TCR beta constant sequence). 
Standard 35 cycle RT-PCR reactions were established using the cDNA synthesized 
from the CTL clone and the above primers utilizing the proofreading thermostable 
polymerase, PWO (Roche). The resultant PCR band, about 850 bp for Valpha and 
about 950 for Vbeta, was ligated into a PCR blunt vector (Invitrogen) and transformed 
into E. coli. E. coli transformed with plasmids having full-length alpha and beta chains 
were identified.. Large scale preparations of the corresponding plasmids were 
generated, and these plasmids were sequenced. The Valpha sequence (SEQ ID 
NO:439) was shown by nucleotide sequence alignment to be homologous to Valpha8.1, 
while the Vbeta sequence (SEQ ID NO:440) was shown by nucleotide sequence 
alignment to be homologous to Vbeta8.2. 

EXAMPLE 22 

Recombinant Expression of Full Length L762P in Mammalian Cells 

Full length L762P cDNA was subcloned into the mammalian expression 
vectors VR1012 and pCEP4 (Invitrogen). Both expression vectors had previously been 
modified to contain a FLAG epitope tag. These constructs were transfected into 
HEK293 and CHL-1 cells (ATCC) using Lipofectamine 2000 reagent (Gibco). Briefly, 
both the HEK and CHL-1 cells were plated at a density of 100,000 cells/ml in DMEM 
(Gibco) containing 10% FBS (Hyclone) and grown overnight. The following day, 4ul 
of Lipofectamine 2000 was added to lOOul of DMEM containing no FBS and incubated 
for 5 minutes at room temperature. The Lipofectamine/DMEM mixture was then added 
to lug of L762P Flag/pCEP4 or L762P Flag/VR1012 plasmid DNA resuspended in 
lOOul DMEM and incubated for 15 minutes at room temperature. The 
Lipofectamine/DNA mix was then added to the HEK293 and CHL-1 cells and 
incubated for 48-72 hours at 37°C with 7% C0 2 . Cells were rinsed with PBS, then 
collected and pelleted by centrifugation. L672P expression was detected in the 
transfected HEK293 and CHL-1 cell lysates by Western blot analysis and was detected 
on the surface of transfected HEK cells by flow cytometry analysis. 
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For Western blot analysis, whole cell lysates were generated by 
incubating the cells in Triton-XlOO containing lysis buffer for 30 minutes on ice. 
Lysates were then cleared by centrifugation at 10,000 rpm for 5 minutes at 4°C. 
Samples were diluted with SDS-PAGE loading buffer containing beta-mercaptoethanol, 
then boiled for 10 minutes prior to loading the SDS-PAGE gel. The protein was 
transferred to nitrocellulose and probed using 1 ng/ml purified anti-L762P rabbit 
polyclonal sera (lot #690/73) or non-diluted anti-L762P mAb 153.20.1 supernatant. 
Blots were revealed using either goat anti-rabbit Ig coupled to HRP or goat anti-mouse 
Ig coupled to HRP followed by incubation in ECL substrate. 

For flow cytometric analysis, cells were washed further with ice cold 
staining buffer (PBS+1%BSA +Azide). Next, the cells were incubated for 30 minutes 
on ice with lOug/ml of purified anti-L762P polyclonal sera (lot #690/73) or a 1:2 
dilution of anti-L762P mAb 153.20.1 supernatant. The cells were washed 3 times with 
staining buffer and then incubated with a 1:100 dilution of goat anti-rabbit Ig(H+L)- 
FITC or goat anti-mouse Ig(H+L)-FITC reagent (Southern Biotechnology) for 30 
minutes on ice. After 3 washes, the cells were resuspended in staining buffer containing 
propidium iodide (PI), a vital stain that allows for the exclusion of permeable cells, and 
analyzed by flow cytometry. 

EXAMPLE 23 

Generation of Polyclonal Antibodies to Lung Tumor Antigens 

Three lung antigens, L523S (SEQ ID NO:176), L763P (SEQ ID NO:159) 
and L763 peptide #2684 (SEQ ID NO:441), were expressed and purified for use in 
antibody generation. 

L523S and L763P were expressed in an E. coli recombinant expression 
system and grown overnight in LB Broth with the appropriate antibiotics at 37°C in a 
shaking incubator. The next morning, 10 ml of the overnight culture was added to 500 
ml of 2x YT with the appropriate antibiotics in a 2L-baffled Erlenmeyer flask. When 
the optical density of the culture reached 0.4-0.6 at 560 nanometers, the cells were 
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induced with IPTG (1 mM). Four hours after induction with IPTG, the cells were 
harvested by centrifugation. 

The cells were then washed with phosphate buffered saline and 
centrifuged again. The supernatant was discarded and the cells were either frozen for 
future use or immediately processed. Twenty milliliters of lysis buffer was added to the 
cell pellets and vortexed. To break open the E. coli cells, this mixture was then run 
through a french press at a pressure of 16,000 psi. The cells were then centrifuged again 
and the supernatant and pellet were checked by SDS-PAGE for the partitioning of the 
recombinant protein. 

For proteins that localized to the cell pellet, the pellet was resuspended in 
10 mM Tris pH 8.0, 1% CHAPS and the inclusion body pellet was washed and 
centrifuged again. This procedure was repeated twice more. The washed inclusion body 
pellet was solubilized with either 8M urea or 6M guanidine HC1 containing 10 mM Tris 
pH 8.0 plus 10 mM imidazole. The solubilized protein was added to 5 ml of nickel- 
chelate resin (Qiagen) and incubated for 45 minutes to 1 hour at room temperature with 
continuous agitation. 

After incubation, the resin and protein mixture was poured through a 
disposable column and the flow through was collected. The column was then washed 
with 10-20 column volumes of the solubilization buffer. The antigen was then eluted 
from the column using 8M urea, 10 mM Tris pH 8.0 and 300 mM imidazole and 
collected in 3 ml fractions. A SDS-PAGE gel was run to determine which fractions to 
pool for further purification. 

As a final purification step, a strong anion exchange resin, in this case 
Hi-Prep Q (Biorad), was equilibrated with the appropriate buffer and the pooled 
fractions from above were loaded onto the column. Each antigen was eluted off the 
column with an increasing salt gradient. Fractions were collected as the column was 
run and another SDS-PAGE gel was run to determine which fractions from the column 
to pool. 

The pooled fractions were dialyzed against 10 mM Tris pH 8.0. The 
release criteria were purity as determined by SDS-PAGE or HPLC, concentration as 
determined by Lowry assay or Amino Acid Analysis, identity as determined by amino 



153 



WO 02/47534 



PCT7US01/47576 



terminal protein sequence, and endotoxin level was determined by the Limulus (LAL) 
assay. The proteins were then put in vials after filtration through a 0.22-micron filter 
and the antigens were frozen until needed for immunization. 

The L763 peptide #2684 was synthesized and conjugated to KLH and 
froze until needed for immunization. 

The polyclonal antisera were generated using 400 micrograms of each 
lung antigen combined with 100 micrograms of muramyldipeptide (MDP). An equal 
volume of Incomplete Freund's Adjuvant (IF A) was added and then mixed and injected 
subcutaneously (S.C.) into a rabbit. After four weeks, the rabbit was S.C. boosted with 
200 micrograms of antigen mixed with an equal volume of IF A. Thereafter the rabbit 
was I.V. boosted with 100 micrograms of antigen. The animal was bled seven days 
following each boost. The blood was then incubated at 4°C for 12-24 hours followed 
by centrifugation to generate the sera. 

The polyclonal antisera were characterized using 96 well plates coated 
with antigen and incubated with 50 microliters (typically 1 microgram/microliter) of the 
polyclonal antisera at 4°C for 20 hours. Basically, 250 microliters of BSA blocking 
buffer was added to the wells and incubated at room temperature for 2 hours. Plates 
were washed 6 times with PBS/0.1% Tween. The rabbit sera were diluted in PBS/0.1% 
Tween/0.1%BSA. 50 microliters of diluted sera was added to each well and incubated 
at room temperature for 30 minutes. The plates were washed as described above, and 
then 50 microliters of goat anti-rabbit horseradish peroxidase (HRP) at a 1:10000 
dilution was added and incubated at room temperature for 30 minutes. 

The plates were washed as described above, and 100 microliters of TMB 
Microwell Peroxidase Substrate was added to each well. Following a 15-minute 
incubation in the dark at room temperature, the colorimetric reaction was stopped with 
100 microliters of IN H 2 S0 4 and read immediately at 450 nm. All the polyclonal 
antibodies showed immunoreactivity to the appropriate antigen.Tables 7-9 show the 
antibody reactivity of rabbit antisera in serial dilution to the three lung antigens, L523S, 
L763P and L763 peptide #2684. The first column shows the antibody dilutions. The 
columns "Pre-immune sera" indicate ELISA data for two experiments using pre- 
immune sera. These results are averaged in the fourth column. The columns "anti- 
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L523S, L763P or #2684" indicate ELISA data for two experiments using sera from 
rabbits immunized as described in this Example, using the respective antigen, referred 
to as either L523S, L763P or #2684 in the tables. 



Table 7 



Antibody 
dilution 


Pre- 
immune 
sera (1) 


Pre- 
immune 
sera (2) 


Average 


Anti- 
L523S 
(1) 


Anti- 
L523S 
(2) 


Average 


1 


1000 


0.14 


0.14 


0.14 


2.36 


2.37 


2.37 


1 


2000 


0.12 


0.10 


0.11 


2.29 


2.23 


2.26 


1 


4000 


0.10 


0.09 


0.10 


2.11 


2.17 


2.14 




8000 


0.09 


0.09 


0.09 


1.98 


2.00 


1.99 


1 


16000 


0.09 


0.09 


0.09 


1.73 


1.76 


1.75 


1 


32000 


0.09 


0.09 


0.09 


1.35 


1.40 


1.37 


1 


64000 


0.09 


0.11 


0.10 


0.94 


0.98 


0.96 


1 


128000 


0.09 


0.08 


0.08 


0.61 


0.61 


0.61 


1 


256000 


0.08 


0.08 


0.08 


0.38 


0.38 


0.38 


1 


512000 


0.09 


0.08 


0.08 


0.24 


0.25 


0.25 


1 


1024000 


0.08 


0.08 


0.08 


0.17 


0.17 


0.17 


1 


2048000 


0.08 


0.08 


0.08 


0.14 


0.13 


0.13 


Table 8 


Antibody 
dilution 


Pre- 
immune 
sera (1) 


Pre- 
immune 

sera (2) 


Average 


Anti- 
L763P 
(1) 


Anti- 
L763P 
(2) 


Average 


1 


1000 


0.09 


0.11 


0.10 


1.97 


1.90 


1.93 


1 


2000 


0.07 


0.07 


0.07 


1.86 


1.84 


1.85 


1 


4000 


0.06 


0.06 


0.06 


1.82 


1.81 


1.81 


1 


8000 


0.06 


0.06 


0.06 


1.83 


1.81 


1.82 


1 


16000 


0.06 


0.05 


0.06 


1.79 


1.74 


1.76 


1 


32000 


0.06 


0.06 


0.06 


1.56 


1.51 


1.53 


1 


64000 


0.06 


0.05 


0.05 


1.35 


1.34 


1.35 


1 


128000 


0.05 


0.05 


0.05 


1.01 


0.98 


0.99 


1 


256000 


0.06 


0.05 


0.05 


0.69 


0.70 


0.70 


1 


512000 


0.06 


0.05 


0.05 


0.47 


0.44 


0.46 


1 


1024000 


0.06 


0.05 


0.06 


0.27 


0.27 


0.27 


1 


2048000 


0.05 


0.05 


0.05 


0.16 


0.15 


0.16 
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Table 9 



Antibody 


Pre- 


Pre- 


Average 


Anti- 


Anti- 


Average 


dilution 


immune 


immune 




#2684 


#2684 








sera (1) 


sera (2) 




(1) 


(2) 




1 


1000 


0.07 


TTF^ 

0.07 


007 


2 10 


2 00 


2 05 


1 


2000 


0.07 


0.06 


0.06 


1.95 


1.96 


1.95 


1 


4000 


0.06 


0.06 


0.06 


1.77 


1.82 


1.79 


1 


8000 


0.06 


0.06 


0.06 


1.79 


1.81 


1.80 


1 


16000 


0.06 


0.06 


0.06 


1.54 


1.50 


1.52 


1 


32000 


0.06 


0.06 


0.06 


1.27 


1.20 


1.24 


1 


64000 


0.06 


0.06 


0.06 


0.85 


0.82 


0.83 


0 


0.06 


0.06 


0.06 


0.06 


0.06 


0.06 



Tables 10-12 show the affinity purification of the respective antibodies 
to the three lung antigens, L523S, L763P and L763 peptide #2684. 



Table 10 



Antibody 


Affinity 


Affinity 


Average 


Affinity 


Affinity 


Average 


cone. 


pure 


pure 




pure 


pure 




(ug/ml) 


(salt 


(salt 




(acid 


(acid 






peak) 


peak) 




peak) 


peak) 




1.0 


2.38 


2.35 


2.36 


2.25 


2.31 


2.28 


0.5 


2.24 


2.22 


2.23 


2.19 


2.18 


2.18 


0.25 


2.05 


2.09 


2.07 


2.01 


2.03 


2.02 


0.13 


1.70 


1.81 


1.75 


1.74 


1.74 


1.74 


0.063 


1.44 


1.44 


1.44 


1.43 


1.38 


1.40 


0.031 


1.05 


1.05 


1.05 


0.99 


0.99 


0.99 


.0.016 


0.68 


0.67 


0.68 


0.65 


0.64 


0.64 


0.0078 


0.43 


0.42 


0.42 


0.39 


0.39 


0.39 


0.0039 


0.27 


0.26 


0.27 


0.24 


0.26 


0.25 


0.0020 


0.18 


0.20 


0.19 


0.19 


0.18 


0.19 


0.0010 


0.13 


0.14 


0.13 


0.13 


0.14 


0.13 


0.00 


0.11 


0.12 


0.11 


0.10 


0.12 


0.11 
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Table 11 



Antibody 


Affinity 


Affinity 


Average 


dilution 


pure 


pure 




1 


1000 


1.64 


1.77 


1.70 


1 


2000 


1.59 


1.76 


1.68 


1 


4000 


1.48 


1.62 


1.55 


1 


8000 


1.35 


1.43 


1.39 


1 


16000 


1.09 


1.19 


1.14 


1 


32000 


0.81 


0.89 


0.85 


1 


64000 


0.55 


0.58 


0.56 


1 


128000 


0.31 


0.35 


0.33 


1 


256000 


0.18 


0.20 


0.19 


1 


512000 


0.11 


0.12 


0.11 


1 


1024000 


0.07 


0.07 


0.07 


1 


2048000 


0.06 


0.06 


0.06 



Table 12 



Antibody 
cone. 
(Hg/ml) 


Affinity 
pure 


Affinity 
pure 


Average 


1.0 


2.00 


2.02 


2.01 


0.5 


2.01 


1.93 


1.97 


0.25 


1.84 


1.83 


1.84 


0.13 


1.80 


1.83 


1.81 


0.06 


1.39 


1.60 


1.50 


0.03 


1.33 


1.35 


1.34 


0.02 


0.94 


0.93 


0.94 


0.00 


0.06 


0.06 


0.06 



EXAMPLE 24 
Full-Length cDNA Sequence Encoding L529S 

The isolation of a partial sequence (SEQ ID NO: 106) for lung antigen 
L529S was previously provided in Example 2. This partial sequence was used as a 
query to identify potential full length cDNA and protein sequences by searching against 
publicly available databases. The predicted full-length cDNA sequence for the isolated 



157 



WO 02/47534 



PCT7US01/47576 



cloned sequence of SEQ ID NO:106 is provided in SEQ ID NO:442. The deduced 
amino acid sequence of the antigen encoded by SEQ ID NO:442 is provided in SEQ ID 
NO:443. It was previously disclosed in Example 2 that L529S shows similarity to 
connexin 26, a gap junction protein. 

EXAMPLE 25 

Expression in Megaterium of a Histidine Tag-Free L523S Fusion Protein 

PCR was performed on the L523S coding region with the following 

primers: 

Forward primer PDM-734 5' caatcaggcatgcacaacaaactgtatatcggaaac 3' 
(SEQIDNO:444)Tni63°C. 

Reverse primer PDM-735 5' cgtcaagatcttcattacttccgtcttgac 3' (SEQ ID 
NO:445) TM 60°C. 

The PCR conditions were as follows: 
lOul 1 OX Pfu buffer 
l.Oul lOmMdNTPs 
2.0ul lOuM each primer 
83 ul sterile water 

1.5ul Pfu DNA polymerase (Stratagene, La Jolla, CA) 
50qg DNA 

96°C for 2 minutes, 96°C for 20 seconds, 62°C for 15 seconds, 72°C for 
4 minute with 40 cycles and then 72°C for 4 minutes. 

The PCR product was digested with SphI and Bglll restriction enzymes, 
gel purified and then cloned into pMEG-3, which had been digested with SphI and Bglll 
restriction enzymes. The correct construct was confirmed by DNA sequence analysis 
and then transformed into Megaterium cells for expression. 

The amino acid sequence of expressed recombinant L523S is shown in 
SEQ ID NO:446, and the DNA coding region sequence is shown in SEQ ID NO:447. 
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EXAMPLE 26 

Expression in k Coli of a Histidine Tag-Free L523S Fusion Protein 

PCR was performed on the L552S coding region with the following 

primers: 

Forward primer PDM-733 5' cgtactagcatatgaacaaactgtatatcggaaac 3' 
(SEQ ID NO:448) Tm 64°C. 

Reverse primer PDM-415 5' ccatagaattcattacttccgtcttgactgagg 3' (SEQ 
IDNO:426)TM62°C. 

The PCR conditions were as follows: 
lOullOXPfu buffer 
l.Oul lOmMdNTPs 
2.0ul 10(iM each primer 
83|ul sterile water 

1 .5ul Pfu DNA polymerase (Stratagene, La Jolla,' CA) 
50ng DNA 

96°C for 2 minutes, 96°C for 20 seconds, 62°C for 15 seconds, 72°C for 
4 minute with 40 cycles and then 72 °C for 4 minutes. 

The PCR product was digested with Ndel and EcoRI restriction 
enzymes, gel purified and then cloned into pPDM, a modified pET28 vector, which had 
been digested with Ndel and EcoRI restriction enzymes. The correct construct was 
confirmed by DNA sequence analysis and then transformed into BLR pLys S and HMS 
174 pLys S cells for expression. 

The amino acid sequence of expressed recombinant L523S is shown in 
SEQ ID NO:449, and the DNA coding region sequence is shown in SEQ ID NO:450. 

EXAMPLE 27 

Epitope- Analysis of L514S and L523S-Specific Antibodies 

Peptides of candidate antigens can be used for the evaluation of antibody 
responses in both preclinical and clinical studies. These data allow one to further 
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confirm the antibody response against a certain candidate antigen. Protein-based ELISA 
with and without competitive peptides and peptide-based ELISA can be used to 
evaluate these antibody responses. Peptide ELISA is especially useful since it can 
further exclude the false positive of the antibody titer observed in protein-based ELISA 
as well as to provide the simplest assay system to test antibody responses to candidate 
antigens. In this example, data was obtained using both L514S- and L523S-peptides 
that show that individual cancer patients produce L514S- and L523S-specific 
antibodies. The L5 14S-specific antibodies recognize primarily the following epitope of 
L514S: 

aa86-110: LGKEVRDAKITPEAFEKLGFPAAKE (SED ID 

NO:451). 

This epitope is the common epitope in humans. A rabbit antibody 
specific for L514S recognizes two addition epitopes of L514S: 

(1) aa21-45: KASDGDYYTLAVPMGDVPMDGISVA (SEQ ID 
NO:452) 

(2) aal21-135: PDPJDVNLTHQLNPKVK (SED ID NO:453) 
It was further found that the SEQ ID NO:452 is common to both L514S 

isoforms, L514S-13160 and L514S-13166, whereas the other epitopes, SEQ ID NO:451 
and SEQ ID NO:453, are probably specific to the isoform, L514S-13160. 

The L523S-specific antibodies recognize primarily the following epitope 

ofL523S: 

aa 440-460: KIAPAEAPDAKVRMVIITGP (SEQ ID NO:454). 
This epitope is the common epitope in humans. A rabbit antibody 
specific for L523S recognizes two other epitopes: 

(1) aal56-175 PDGAAQQNNNPLQQPRG (SEQ ID 
NO:455) 

(2) aa326-345: RTITVKGNVETCAKAEEEIM (SED ID 
NO:456) 

In further studies, it was determined by peptide based ELISAs that eight 
additional epitopes of L523S were recognized by L523S-specific antibodies: 
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(1) aa40-59 AFVDCPDESWALKAIEALS (SEQ ID 
NO:457) 

(2) aa80-99: IRKLQIRNIPPHLQWEVLDS (SED ID 
NO:458) 

(3) aal60-179: AQQNPLQQPRGRRGLGQRGS (SEQ ID 
NO:459) 

(4) aal80-199: DVHRKENAGAAEKSITILST (SED ID 
NO:460) 

(5) aa320-339: LYNPERTITVKGNVETCAKA (SEQ ID 
NO:461) 

(6) aa340-359: EEEIMKKIRESYENDIASMN (SED ID 
NO:462) 

(7) aa370-389: LNALGLFPPTSGMPPPTSGP (SEQ ID 
NO:463) 

(8) aa3 80-399: KIAPAEAPDAKVRMVIITGP (SED ID 
NO:464) 

Out of these, six epitopes are common in both lung plural effusion fluid 
samples and in sera of lung patients. Of these six, SEQ ID NO:459 and SEQ ID 
NO:463 have no homology to other L523S-family proteins such as IGF-II mRNA- 
binding proteins 1 and 2. Accordingly, this indicates that these two peptides can be 
used as an assay system to determine the antibody response to L523S. 

EXAMPLE 28 

Generation of L523S-Specific CTL Lines Using In Vitro Whole-Gene Priming 

To determine if L523S is capable of generating a CD8 + T cell immune 
response, CTLs were generated using in vitro whole-gene priming methodologies with 
tumor antigen-vaccinia infected DC (Yee et al, The Journal of Immunology, 
157(9):4079-86, 1996), human CTL lines were derived that specifically recognize 
autologous fibroblasts transduced with the L552S tumor antigen, as determined by 
interferon-gamma ELISPOT analysis. Specifically, dendritic cells (DC) were 
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differentiated from Percoll-purified monocytes derived from PBMC of normal human 
donors by plastic adherence and growing for five days in RPMI medium containing 
10% human serum, 50 ng/ml human GM-CSF and 30 ng/ml human IL-4. Following the 
five days of culture, the DC were infected overnight with a recombinant adenovirus that 
expresses L523S at a multiplicity of infection (M.O.I) of 33, 66 and 100, and matured 
overnight by the addition of 2 ug/ml CD40 ligand. The virus was then inactivated by 
UV irradiation. In order to generate a CTL line, autologous PBMC were isolated and 
CD 8+ T cells were enriched for by the negative selection using magnetic beads 
conjugated to CD4+, CD14+, CD16+, CD19+, CD34+ and CD56+ cells. CD8+ T cells 
specific for L523S were established in round bottom 96-well plates using 10,000 L523S 
expressing DCs and 100,000 CD8+ T cells per well in RPMI supplemented with 10% 
human serum, lOng/ml of IL-6 and 5ng/ml of IL-12. The cultures were restimulated 
every 7-10 days using autologous primary fibroblasts retro virally transduced with 
L523S, and the costimulatory molecule CD80 in the presence of IL-2. The cells were 
also stimulated with IFN-gamma to upregulate MHC Class I. The media was 
supplemented with lOU/ml of IL-2 at the time of stimulation as well as on days 2 and 5 
following stimulation. Following three stimulation cycles, ten L523S specific CD8+ T 
cell lines were identified using interferon-gamma ELISPOT analysis that specifically 
produce interferon-gamma when stimulated with the L523S tumor antigen-transduced 
autologous fibroblasts, but not with a control antigen. 

One line, 6B1, was cloned using anti-CD3 and feeder cells. The clones 
were tested for specificity on L523S-transduced fibroblasts. In addition, using a panel 
of HLA-mismatched lines transduced with a vector expressing L523S and measuring 
interferon-gamma production by this CTL line in an ELISPOT assay, it was determined 
that this clone 6B1.4B8 is restricted by HLA-A0201. 

Also using transfected Cos cells, it was shown that clone 6B1.4B8 
recognizes Cos cells transfected with pcDNA3 HLA A0201/L523S in an HLA- 
restricted and antigen specific manner. 

An epitope mapping study demonstrated the clone 6B1.4B8 recognizes 
HLA-A201 LCL loaded with peptide pool 3 (a polypeptide corresponding to amino acid 
positions 33-59 of L523S. 
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A peptide pool breakdown study demonstrated that clone 6B1.4B8 
recognizes autologous B-LCL loaded with 15-mer peptides from amino acid positions 
37-55 of L523S, TGYAFVCPDESWALKAIE (SEQ ID NO:465). A further peptide 
breakdown study demonstrated that clone 6B1.4B8 recognizes T2 cells loaded with the 
same 15-mer peptides. 

A peptide recognition study demonstrated that clone 6B1.4B8 prefers T2 
cells loaded with the peptide FVDCPESWAL (SEQ ID NO.466) which is corresponds 
to the amino acid sequence at positions 41-51 of L523S and is encoded by the DNA 
sequence of SEQ ID NO:467. 

EXAMPLE 29 
L523S Expression in Other Human Cancers 

It was previously disclosed in Example 2 that L523S is expressed in lung 
cancers including squamous, adenocarcinoma and small cell carcinoma. To further 
evaluate the expression profile of this antigen an electronic express profiling was 
performed. This was done by searching a L523S-specific sequence against a public 
EST database. Results of this profiling indicate that L523S may also be present in 
colon adenocarcinomas, prostate adenocarcinomas, CML, AML, Burkitt's Lymphoma, 
brain tumors, retinoblastomas, ovarian tumors, teratocarcinomas, uterus myosarcomas, 
germ cell tumors as well as pancreatic and cervical tumor cell lines. 

EXAMPLE 30 

Immunohistochemistry Analysis of L523S 

In order to determine which tissues express the lung tumor antigen 
L523S, immunohistochemistry (IHC) analysis was performed on a diverse range of 
tissue types. Polyclonal antibodies specific for L523S (SEQ ID NO.T76) were generated 
as described in Example 23. IHC was performed essentially as described in Example 6. 
Briefly, tissue samples were fixed in formalin solution for 12-24 hours and embedded in 
paraffin before being sliced into 8 micron sections. Steam heat induced epitope 
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retrieval (SHIER) in 0.1 sodium citrate buffer (pH 6.0) was used for optimal staining 
conditions. Sections were incubated with 10% serum in PBS for 5 minutes. The 
primary L523S antibody was added to each section for 25 minutes followed by a 25 
minute incubation with anti-rabbit biotinylated antibody. Endogenous peroxidase 
activity was blocked by three 1.5 minute incubations with hydrogen peroxidase. The 
avidin biotin complex/ horse radish peroxidase (ABC/HRP) system was used along with 
DAB chromogen to visualize antigen expression. Slides were counterstained with 
hematoxylin to visualize the cell nuclei. 

IHC analysis of L523S expression revealed that of the lung cancer 
tissues tested over 90% of tissue samples demonstrated high over-expression of the lung 
tumor antigen (10/1 1 adenocaricomas and 8/9 squamous). Of the normal tissues tested, 
all were negative for expression of L523S, with the exception of weak staining in 
normal bronchus, testis, liver, and trachea. 

EXAMPLE 31 

Generation and Characterization of L762 Human Monoclonal 
Antibodies 

Cell supernatants from hybridoma fusions from the Xenomouse strain of 
transgenic mice were screened for ability to bind to L762P. All results are shown in 
Table 13. The primary screen was to test monoclonal supernatants for reactivity to 
L762P by ELISA analysis using recombinant bacterial expressed protein. We next 
tested the human supernatants for reactivity to surface expressed L762P by whole cell 
ELISA using fluorimetry analysis. Specific reactivity of the humab supernatants was 
confirmed by performing FACS analysis on cells transfected with either an irrelevant 
plasmid or a plasmid expressing L762P. FI/CFI is the relative fold increase in 
fluorescence intensity (FI) of the anti-L762P humab primary antibody to irrelevant 
human primary antibody. FI/CFI/A20 is the relative fold increase in fluorescence 
intensity (FI) of the anti-L762P humab primary antibody to irrelevant human primary 
antibody over the FI of the anti-L762P mouse monoclonal antibody 153A20.1. 
FI/CFI/R690 is the relative fold increase in fluorescence intensity (FI) of the anti-L762P 
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humab primary antibody to irrelevant human primary antibody over the FI of the anti- 
L762P rabbit polyclonal antibody. FACS VRL762 is the percentage of cells transfected 
with plasmid expressing L762P that were positive following staining with indicated 
monoclonal antibody. FACS VR(-) is the percentage of cells transfected with irrelevant 
plasmid that were positive following staining with indicated monoclonal antibody. 
ELISA is the O.D. values of the indicated monoclonal antibody to recombinant L762P 
protein. The shaded rows in Table 13 indicate those antibodies that will be further 
cloned and characterized. 



Table 13: Human Monoclonal Antibodies Against L762P 



L762PHumAb 


FI/CFI 


FI/CFI/A20 


FI/CFI/R690 


FACSVRL762 


FACS VR (-) 


ELISA 


L762/VR1013 


R-690 


4.59 




1.00 










M-A20 


2.88 


1.00 












1.176 


0.51 


0.18 


0.11 






0.38 




1.178 


1.42 


0.49 


0.31 






0.35 




1.179 


0.47 


0.16 


0.10 






0.07 




1.180 


1.50 


0.52 


0.33 






0.26 




1.182 


1.45 


0.50 


0.32 






0.26 




1.183 


0.75 


0.26 


0.16 






0.24 




1.185 


0.89 


0.31 


0.19 






0.46 




1.186 


3.45 


1.20 


0.75 ' ' % 


32.68 


7.14 %t ;• 


1.22 




1.187 


0.36 


0.13 


0.08 






0.06 




1.188 


0.26 


0.09 


0.06 






0.23 




1.189 


0.50 


0.17 


0.11 






0.44 




1.190 


0.53 


0.18 


0.12 






0.42 




1.191 


3.12 


1.08 


0.68 


41.44 


17.90 


0.86 


1.29 


1.192 


1.91 


0.66 


0.42 






0.12 




1.193 


2.87 


1.00 


0.63 


17.82 


6.43 


0.13 


1.06 . 


1.194 


1.55 


0.54 


0.34 






0.28 




1.195 


0.14 


0.05 


0.03 






0.37 




1.196 


1.97 


0.68 


0.43 






0.89 


1.64 


1.197 


0.43 


0.15 


0.09 






0.08 




1.198 


0.54 


0.19 


0.12 






0.33 




1.199 


0.70 


0.24 


0.15 






0.40 




1.200 


2.00 


0.69 


0.44 






0.38 


1.56 


1.201 


1.62 


0.56 


0.35 






0.29 




1.202 


0.86 


0.30 


0.19 






0.36 




1.203 


1.56 


0.27 


0.18 






0.14 




1.204 


3.32 


0.58 


0.38 


24.83 


6.60 


0.17 


1.91 


1.205 


2.13 


0.37 


0.25 






0.09 




1.206 


0.45 


0.08 


0.05 






0.23 




1.207 


0.60 


0.10 


0.07 






0.39 




1.208 


0.12 


0.02 


0.01 






0.36 




1.209 


15.52 , 


2.71 


1,80 


27.54 


9.54 


0.16 


0.77 


1.210 


0.92 


0.16 


0.11 






0.16 




1.211 


2.83 


0.49 


0.33 






0.42 




1.212 


3.40 


0.59 


0.39 


21.68 


11.36 


0.14 


2.47- 


1.213 


2.32 


0.40 


0.27 






0.38 




1.214 


0.80 


0.14 


0.09 






0.34 




1.215 


3.96 


0.69 


0.46 


38.87 '• 


13.17 


0.33 


- 1.80 


1.216 


1.26 


0.22 


0.15 






0.20 




1.217 


1.99 


0.35 


0.23 






0.26 
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L762PHumAb 


FI/CFI 


FI/CFI/A20 


FI/CFI/R690 


FACSVRL762 


FACSVR(-) 


ELISA 


L762/VR1013 


1.218 


2.29 


0.40 


0.27 






0.10 




1.219 


0.15 


0.03 


0.02 






0.06 




1.220 


0.82 


0.14 


0.09 






0.21 




1.221 


2.29 


0.40 


0.27 






0.12 




1.222 


0.57 


0.10 


0.07 






0.45 




1.223 


0.11 


0.02 


0.01 






0.11 




1.224 


2.08 


0.36 


0.24 






0.25 




1.225 


0.95 


0.17 


0.11 






0.22 




1.226 


-0.32 


-0.06 


-0.04 






0.06 




R-690 


8.62 




1.00 


72.34 


39.83 






M-A20 


5.73 


1.00 




50.23 


6.34 






M-A12 






67.43 


25.15 








M-Irr 






7.74 


7.35 








R-Irr 






30.09 


24.80 








H-Irr 






25.52 


39.14 
























R-690 


3.20 




1.00 










M-A20 


2.33 


1.00 












1.250 


0.15 


0.06 


0.05 






0.28 




1.228 


0.38 


0.16 


0.12 






0.08 




1.229 


0.39 


0.17 


0.12 






0.44 




1.230 


1.78 


0.76 


0.56 






0.13 


1.35 


1.231 


0.42 


0.18 


0.13 






0.47 




1.232 


0.34 


0.15 


0.11 






0.25 




1.233 


7.07 


3.04 
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5.33 


1.00 












1.97 


1.47 


0.28 


0.19 






0.37 




1.98 


3.69 


0.69 


0.49 


38.67 


16.57 


0.43 


1.69 


1.99 


4.32 


0.81 


0.57 . 


38.31 


18.76 


0.40 


1.48 


1.100 


0.22 


0.04 


0.03 






0.32 




1.101 


2.06 


0.39 


0.27 






0.49 




1.102 


0.23 


0.04 


0.03 






0.12 




1.103 


0.33 


0.06 


0.04 






0.28 




1.104 


0.45 


0.08 


0.06 






0.08 




1.105 


4.19 


0.79 


0.55 


37.19 


12.41 


0.25 


2.18 ; 


1.106 


4.22 


0.79 


0.56 


46.24 


30.59 


1.21 


1.58 


1.107 


0.15 


0.03 


0.02 






0.06 




1.108 


0.08 


0.01 


0.01 






0.31 




1.109 


2.70 


0.51 


0.36 


6.5 


6 


0.07 




1.110 


1.02 


0.19 


0.13 






0.35 




1.111 


2.55 


0.48 


0.34 






0.10 




1.112 


3.58 


0.67 


0.47 


18.6 


4.2 


1.25 . . 


1.74 


1.113 


0.37 


0.07 


0.05 






0.35 




1.114 


-0.06 


-0.01 


-0.01 






0.27 




1.115 


0.55 


0.10 


0.07 






0.13 




1.116 


2.24 


0.42 


0.30 






0.44 




1.117 


0.56 


0.10 


0.07 






0.27 




1.118 


0.77 


0.14 


0.10 






0.43 




1.119 


0.78 


0.15 


0.10 






0.41 




1.120 


0.73 


0.14 


0.10 






0.58 




1.121 


0.21 


0.05 


0.03 






0.40 




1.122 


0.11 


0.03 


0.02 






0.29 




1.123 


0.41 


0.11 


0.07 






0.07 




1.124 


3.66 


0,95 


0.61 


41.27 - 


34.83 


0.28 


1.85 
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L762PHumAb 


FI/CFI 


FI/CFI/A20 


FI/CFI/R690 


FACSVRL762 


FACSVR(-) 


ELISA 


L762/VR1013 


1.125 


2.67 


0.69 


0.44 






0.27 


1.55 


1.126 


2.36 


0.61 


0.39 






0.86 


1.71 


1.127 


0.70 


0.18 


0.12 






0.11 




1.128 


2.99 


0.77 


0.50 






0.13 


1.45 


1.129 


0.33 


0.09 


0.06 






0.39 




1.130 


0.40 


0.10 


0.07 






0.18 




1.131 


1.45 


0.38 


0.24 






0.52 




1.132 


0.33 


0.08 


0.05 






0.25 




1.133 


0.17 


0.04 


0.03 






0.24 




1.134 


0.86 


0.22 


0.14 






0.15 




1.135 


1.75 


0.45 


0.29 






0.30 




1.136 


1.35 


0.35 


0.23 , 






0.07 




1.137 


2.30 


0.59 


0.38 






0.83 


1.30 


1.138 


0.83 


0.21 


0.14 






0.60 




1.139 


1.57 


0.41 


0.26 






0.55 




1.140 


1.40 


0.36 


0.23 






1.28 




1.142 


-0.10 


-0.03 


-0.02 






0.26 




1.143 


1.46 


0.38 


0.24 






0.16 




1.144 


2.41 


0.62 


0.40 






0.76 




R-690 


6.00 




1.00 










M-A20 


3.86 


1.00 




56.4 


5 






















R-690 


2.58 


3.22 


1.00 










M-A20 


0.80 


1.00 












1.145 


0.23 


0.29 


0.09 






0.18 




1.146 


-0.12 


-0.15 


-0.05 






0.41 




1.147 


0.14 


0.18 


0.06 






0.31 




1.148 


0.09 


0.11 


0.03 






0.43 




1.149 


0.39 


0.49 


0.15 






0.37 




1.150 


2.23 


2.79 


0.87 . 


17.3 


5,4 


0.70 


1.46 


1.151 


0.13 


0.16 


0.05 






0.29 




1.152 


0.55 


0.69 


0.21 






0.33 




1.154 


-0.20 


-0.25 


-0.08 






0.41 




1.155 


0.16 


0.19 


0.06 






0.23 




1.156 


0.06 


0.07 


0.02 






0.31 




1.158 


0.54 


0.67 


0.21 






0.58 




1.159 


0.78 


0.98 


0.30 






0.09 




1.160 


0.23 


0.29 


0.09 






0.08 




1.162 


0.63 


0.78 


0.24 






0.11 




1.163 


0.20 


0.25 


0.08 






0.10 




1.164 


0.22 


0.27 


0.08 






0.09 




1.166 


1.41 


1.76 


0.55 


22.9 


5.3 


0.52 , 


2.41, 


1.167 


0.32 


0.40 


0.12 






0.08 




1.168 


0.88 


1. 10 


0,34 


15.9 - 


5.1 '. : 


0,48 


1.90 


1.170 


0.22 


0.42 


0.11 






0.21 




1.171 


0.40 


0.76 


0.19 






0.38 




1.172 


0.09 1 


0.17 


0.04 






0.12 




1.174 


0.23 


0.43 


0.11 






0.15 




1.175 


0.14 


0.26 


0.07 






0.20 




R-690 


2.06 


3.91 


1.00 










M-A20 


0.53 


1.00 




56.4 


5 






for 1.170 to 1.175 


Fl-fluorescence intensity of primary antibody 


CFI-fluorescence intensity of human irrelevant primary antibody. 


A20-mouse ar 


ti-L762P monoclonal antibody 








R690-rabbit anti-L762P affinity purified polyclonal antibody 


FACS VRL762-percent positive cells from transient transfection of VR1013/L762 expression 


plasmid 




FACS VR(-)-percent positive cells from tr 


msient transfectic 


n of empty VR1013 expression plasmid 
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EXAMPLE 32 

Epitope Mapping and Purification of hL523S-specific Antibodies 

This Example describes the purification of L523S antibodies that can 
distinguish between human and mouse L523S homologs and will likely distinguish 
between hL523S and hL523S-family members such as hIMP-1 and hIMP-2. 

L523S (mil-length cDNA and amino acid sequence set forth in SEQ ID 
NO:347 and 348, respectively) is one of a family of proteins that includes hIMP-1 and 
hIMP-2. The members of this family of proteins have a high degree of similarity one to 
the other and are also highly similar between species. Thus, generating antibodies that 
specifically recognize human L523S (hL523S) and not other members of the protein 
family in humans or the mouse homologs, has been problematic. However, in order to 
evaluate preclinical and clinical L523S DNA/Adeno viral vaccines by detecting the 
protein expression of L523S, human L523S-specific antibodies are critical. 

Polyclonal antibodies specific for hL523S were generated as described in 
Example 23. These antibodies were used to map epitopes. The epitope analysis 
showed 2 particular peptides of hL523S that were recognized, peptide 16/17 and peptide 
32. 

The amino acid sequences of both hL523S and mouse L523S (mL523S) 
peptide 16/17 and peptide 32 were then compared. Peptide 32/33 is identical between 
hL523S and mL523S. However, as the alignment below indicates, peptide 16/17 has 5 
amino acid differences between the human and mouse homologs (underlined). 



hL523S (16/17) (SEQ ID NO:468): 

I P DEMAAQQN PLQQPRGRRGLGQR 

mL523S (16/17) (SEQ ID NO:469): 

I P DETAAQQN P SPQLRGRRGPGQR 



Moreover, peptide-based ELISAs showed that peptide 17 is specifically 
recognized by lung cancer patient sera #197, and a homology search of peptide 17 
between human IMP (hIMP) family members shows that there is little similarity in this 
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region between family members. The hL523S peptide 17 (and 16/17) has less than 50% 
similarity to hL523S family members such as hIMP-1 and hIMP-2. 

Based upon the epitope mapping of L523S-specific antibodies and the 
data from the homology search, hL523S or mL523S peptide 16/17-conjugated ligands 
were then used to purify human or mouse L523S-specific antibodies from rabbit 
polyclonal antibodies generated against hL523S protein as described in Example 23. 
The data from the antibodies purified by affinity chromatography using ligands 
conjugated with either hL523S-peptide 16/17 or mL523S-peptide 16/17 suggested that 
the affinity of antibodies specific to hL523S-peptide 16/17 is much higher than that of 
antibodies to mL523S-peptide 16/17 since they bind more strongly to hL523S-peptide 
16/17 than to mL523S-peptide 16/17. The difference in affinity between the purified 
antibodies to human and mouse L523S-peptide 16/17 was confirmed by peptide-based 
ELISA. The antibodies purified by hL523S-peptide 16/17 selectively bind to human 
L523S-peptide 16/17 but bind much less or not at all to mL523S-peptide 16/17. 

In order to further characterize the original polyclonal antibodies and 
antibodies purified by hL523S-peptide 16/17, immunoblot analysis was conducted using 
both human lung adenocarcinoma line as a source of hL523S protein and mouse whole 
body embryo (day 17 gestation) as the source of mL523S protein. This analysis showed 
that polyclonal antibodies specific for hL523S recognize hL523S protein expressed in 
the tumor cell line as well as mL523S protein expressed in whole body embryos of day 
17 gestation. However, the addition of hL523S peptide 32/33 blocks binding of 
antibodies to human and mouse L523S proteins. Thus, the crossreactivity of the 
polyclonal antibodies to mL523S protein is due to the existence of antibodies specific to 
hL523S peptide 32/33. In marked contrast, the purified antibodies specific to hL523S 
peptide 16/17 do not bind mL523S protein expressed in mice embryos but do recognize 
hL523S protein expressed in human lung adenocarcinoma cells. These data confirm the 
ELISA data using hL523S-peptide 16/17 and mL523S-peptide 16/17 described above. 

The amino acid sequence of hL523S peptide 16/17 used to purify the 
antibodies is about 60-70% similar to that of the mL523S-peptide 16/17 which is not 
recognized by hL523S-specific antibodies by Western blot analysis and peptide-based 
ELISA. The hL523S peptide 16/17 has less than 50% similarity to hL523S family 
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members such as hIMP-1 and hIMP-2. Taken together, these data suggest that it is 
highly probable that the antibodies purified by hL523S peptide 16/17 described herein 
will also distinguish hL523S protein from the other hL523S family members. 

In summary, antibodies purified with the hL523S peptide 16/17 do not 
recognize the mouse L523S homolog. The amino acid sequence of peptide 16/17 
between hL523S family members is less similar than between human and mouse 
L523S. Thus, the hL523S-specific antibodies described above can be used to 
distinguish between human and mouse L523S and between members of the hL523S 
family of proteins and can therefore be used for the accurate detection of hL523S 
protein expression in animals and humans. 

EXAMPLE 33 
In Vivo Immunogenecity of Lung Tumor Antigen l523 
This example describes two in vivo immunogenicity studies to evaluate 
the vaccination of mice with either an adenovirus containing L523 or with L523 naked 
DNA followed by a second immunization with an adenovirus containing L523. 

The first study involved the immunization of two strains of mice with 
L523 adenovirus. The C57B16 strain of mice is homozygous for HLA-type H-2 b , while 
strain B6D2(F1) is heterozygous for the HLA-type, H-2 b/d . Table 14 describes the 
initial immunization strategy employed. 



Table 14: Immunization with L523 Adenovirus alone: Experimental 

Design 



Group 


Immunization 


Strain (4/group) 


1 


10 s PFU Ad L523 A 


C57BL6 


2 


10 7 PFU Ad hrGFP A 


C57BL6 


3 


10 8 PFU Ad L523 A 


B6D2(F1) 


4 


10 7 PFU Ad hrGFP A 


B6D2(F1) 


5 


Naive 


C57BL6 


6 


Naive 


B6D2(F1) 



PFU=plaque forming unit; GFP=green fluorescent protein; Ad=adenovirus. 

174 



WO 02/47534 



PCT7US01/47576 



Mice were immunized intradermally with either 10 8 PFU of L523- 
adenovirus or 10 7 PFU of an irrelevant adenovirus (hrGFP). Three weeks following 
immunization, IgGl and IgG2a antibody responses to L523 were examined in all groups 
of mice. Briefly, recombinant full length L523 (rL523) was coated onto ELISA plates 
and serum, at multiple dilutions, was added to the wells. Following a 60-minute 
incubation, the serum was washed from the wells and a secondary antibody, either 
specific for an IgGl or IgG2a was added to the plates. Both antibodies were directly 
conjugated to horseradish peroxide (HRP). The levels of L523 antibodies, either IgGl 
or IgG2a, were measured in all groups. In the C57BL6 mice, little to no L523-specific 
antibodies were detected following immunization. However, in the B6D2(F1) strain of 
mice immunized with L523 adenovirus, both IgGl and IgG2a L523-specific antibodies 
were detected at serum dilution as low as 1/1000. 

In addition to detecting L523 -specific antibodies in the serum, 
interferon-gamma (IFN-y) responses were assayed from immune spleen cells following 
in vitro stimulation with rL523 protein. Briefly, spleen cells were harvested from all 
mice groups and cultured for 3 days in 96-well plates. Culture conditions included, 
media alone, 1 or lOug/ml of rL523 protein, or 5ug/ml of concanavalin A (Con A). 
After 3 days, the supernatants were harvested and assayed for IFN-y levels in the 
supernatants. 

Immunization with L523 -adenovirus, but not an irrelevant adenovirus, 
elicited a strong IFN-y response from the spleen cells which were stimulated with 
rL523. In general, responses were stronger in the B6D2(F1) mouse strain, as evidenced 
by both a higher level of IFN-y production, as well as the fact that stimulation with a 
lower antigen concentration (lug/ml) elicited an equally strong response as seen with 
the higher antigen concentration (10|jg/mi). 

Finally, T cell proliferation responses were assayed from immune spleen 
cells by stimulation in vitro with rL523 protein. Briefly, spleen cells were cultured for 4 
days in 96-well plates with, media alone, 1 or lOug/ml of rL523 protein, or Con A. The 
cultures were then pulsed with 3H-thymidine for the final 8 hours of culture. Results 
are represented as the stimulation index (SI) in the presence of antigen relative to 
stimulation with media alone. Results were consistent with those obtained in the IFN-y 
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assay. Immunization with L523 -adenovirus, but not an irrelevant adenovirus, elicited a 
proliferation response in spleen cells stimulated with rL523. A strong SI (average of 
>20) was observed in spleen cells harvested from the B6D2(F1) mouse strain, with 
similar levels of proliferation observed at both protein concentrations. Little or no T 
cell proliferation was observed in the C57BL6 mouse strain. 

A second study involved the immunization of two strains of mice 
initially with L523 naked DNA followed by a second immunization with L523 
adenovirus two weeks later. The mice were harvested 3 weeks after the boost. Table 
15 describes the immunization regimen of the second study. 



Table 15: Immunization with L523 DNA followed by a second 
immunization with L523-Adenovirus: Experimental Design 



Group 


Immunization 


Strain (4/group) 


1 


L523 DNA +10 8 PFU Ad L523 A 


C57BL6 


2 


10 8 PFUAdL523A 


C57BL6 


3 


Irrelevant DNA + 10 7 PFU Ad hrGFP A 


C57BL6 


4 


10 7 PFU Ad hrGFP A 


C57BL6 


5 


Naive 


C57BL6 


6 


L523 DNA +10 8 PFU Ad L523 A 


B6D2(F1) 


7 


10 8 PFUAdL523A 


B6D2(F1) 


8 


Irrelevant DNA + 10 7 PFU Ad hrGFP A 


B6D2(F1) 


9 


10 7 PFU Ad hrGFP A 


B6D2(F1) 


10 


Naive 


B6D2(F1) 



PFU=plaque forming unit; GFP=green fluorescent protein; Ad=adenovirus. 

As described in the first study, strong IgGl and IgG2a antibody 



responses were observed in B6D2(F1) mice following immunization with L523- 
adenovirus. Immunizing with L523 DNA appeared to increase the overall L523- 
specific antibody response compared to responses achieved with immunization with 
L523 -adenovirus alone. C57BL6 mice elicited little or no L523-specific antibody 
responses following immunization with L523-adenovirus, but were some slightly 
positive responses were detected in mice immunized with L523 DNA followed by a 
second immunization with L523 -adenovirus. 
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IFN-y responses were assayed from immune spleen cells by stimulation 
in vitro with rL523 protein. These results confirm those observed in the initial study 
demonstrating the immunogenecity of L523 in animals. The results also suggest that 
initially immunizing the animals with L523 DNA, prior to immunization with L523- 
adeonvirus, does not significantly increase the CD4 response. As with the initial study, 
responses appear to be stronger in the B6D2(F1) strain of mice than the C57BL6 strain. 

As with the initial study, T cell proliferation responses were assayed 
from immune spleen cells by stimulation in vitro with rL523 protein. The results from 
using two rounds of immunization are consistent with those obtained from the first 
study, rmmunization with L523 DNA prior to a second round of immunization with 
L523-adenovirus did not significantly increase the proliferation responses generated in 
the mice. As with the first study, responses were stronger in the B6D2(F1) mouse strain 
than in the C57BL6 strain. 

The difference in HLA types between the two strains of mice could 
explain variations in the extent of the immune responses detected. As described above, 
the C57BL6 strain is homozygous for H-2 b , while the B6D2(F1) is heterozygous for H- 
2 b/d . The increased diversity of the B6D2(F1) strains HLA type allows for a greater 
number of epitopes derived from the L523 protein to be presented. In this strain, 
epitopes specific for both H-2 b and H-2 d can be presented, while only H-2 b epitopes can 
be presented by the C57BL6 strain. 

From the foregoing it will be appreciated that, although specific 
embodiments of the invention have been described herein for purposes of illustration, 
various modifications may be made without deviating from the spirit and scope of the 
invention. Accordingly, the invention is not limited except as by the appended claims. 
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CLAIMS 



What is Claimed : 



1. 



A method for inducing an immune response in an animal, 



comprising: 



a) 



providing a composition comprising a polynucleotide encoding at 



least an immunogenic portion of a lung carcinoma polynucleotide wherein the 
polynucleotide has at least 90%identity with SEQ ID NO:347; 



2. The method of claim 1 , wherein said composition further 
comprises a component selected from the group consisting of a physiologically 
acceptable carrier or an adjuvant. 

3. A method according to claim 1, wherein the lung carcinoma 
polynucleotide is delivered by a viral based delivery system. 

4. A method according to claim 3, wherein the viral based delivery 
system is an adenovirus. 

5. The method of claim 1 , wherein the immune response induced is 
a CD4+ T helper response. 

6. The method of claim 1 , wherein the immune response induced is 
a CD8+ cytotoxic T lymphocyte response. 

7. The method of claim 1, wherein the immune response induced is 
both a CD4+ T helper and CD8+ cytotoxic T cell immune response. 



b) 
c) 



administering said polynucleotide; and 

thereby inducing an immune response in an animal. 



178 



WO 02/47534 



PCT7US01/47576 



8. An isolated polynucleotide comprising a sequence selected from 
the group consisting of: 

(a) sequences provided in SEQ ID NO:35 1 , 353, 358, 362, 364, 366, 
368, 370-375, 420, 424, 428, 431, 434, 442, 447, 450 and 467; 

(b) complements of the sequences provided in SEQ ID NO:351, 353, 
358, 362, 364, 366, 368, 370-375, 420, 424, 428, 431, 434, 442, 447, 450 and 467; 

(c) sequences consisting of at least 10 contiguous residues of a 
sequence provided in SEQ ID NO:351, 353, 358, 362, 364, 366, 368, 370-375, 420, 
424, 428, 431, 434, 442, 447, 450 and 467; 

(d) sequences that hybridize to a sequence provided in SEQ ID 
NO:351, 353, 358, 362, 364, 366, 368, 370-375, 420, 424, 428, 431, 434, 442, 447, 450 
and 467, under highly stringent conditions; 

(e) sequences having at least 75% identity to a sequence of SEQ ID 
NO:351, 353, 358, 362, 364, 366, 368, 370-375, 420, 424, 428, 431, 434, 442, 447, 450 
and 467; 

(f) sequences having at least 90% identity to a sequence of SEQ ID 
NO:351, 353, 358, 362, 364, 366, 368, 370-375, 420, 424, 428, 431, 434, 442, 447, 450 
and 467; and 

(g) degenerate variants of a sequence provided in SEQ ID NO:351, 
353, 358, 362, 364, 366, 368, 370-375, 420, 424, 428, 431, 434, 442, 447, 450 and 467. 

9. An isolated polypeptide comprising an amino acid sequence 
selected from the group consisting of: 

(a) sequences having at least 90% identity to a polypeptide having an 
amino acid sequence of any one of the sequences provided in SEQ ID NO:352, 354, 
357, 361, 363, 365, 367, 369, 376-382, 387-419, 423, 427, 430, 433, 441, 443, 446, 
449, 451-466 and 468-469; 

(b) sequences encoded by a polynucleotide of claim 8 ; 

(c) sequences having at least 70% identity to a sequence encoded by 
a polynucleotide of claim 8; and 
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(d) sequences having at least 90% identity to a sequence encoded by 
a polynucleotide of claim 8. 

10. An expression vector comprising a polynucleotide of claim 8 
operably linked to an expression control sequence. 

11. A host cell transformed or transfected with an expression vector 
according to claim 10. 

12. An isolated antibody, or antigen-binding fragment thereof, that 
specifically binds to a polypeptide of claim 9. 

13. A method for detecting the presence of a cancer in a patient, 
comprising the steps of: 

(a) obtaining a biological sample from the patient; 

(b) contacting the biological sample with a binding agent that binds 
to a polypeptide of claim 9; 

(c) detecting in the sample an amount of polypeptide that binds to 
the binding agent; and 

(d) comparing the amount of polypeptide to a predetermined cut-off 
value and therefrom determining the presence of a cancer in the patient. 

14. A fusion protein comprising at least one polypeptide according to 

claim 9. 



15. A fusion protein according to claim 14, wherein the fusion 
protein is selected from the group consisting sequences provided in SEQ ID NO:352, 
354, 423,427, 430 and 433. 
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16. An oligonucleotide that hybridizes to a sequence recited in SEQ 
ID NO:351, 353, 358, 362, 364, 366, 368, 370-375, 420, 424, 428, 431, 434, 442, 447, 
450 and 467 under highly stringent conditions. 



17. A method for stimulating and/or expanding T cells specific for a 
tumor protein, comprising contacting T cells with at least one component selected from 
the group consisting of: 

(a) polypeptides according to claim 9; 

(b) polynucleotides according to claim 8; and 

(c) antigen-presenting cells that express a polynucleotide according 

to claim 8, 

under conditions and for a time sufficient to permit the stimulation 
and/or expansion of T cells. 

18. An isolated T cell population, comprising T cells prepared 
according to the method of claim 17. 

19. A composition comprising a first component selected from the 
group consisting of physiologically acceptable carriers and immunostimulants, and a 
second component selected from the group consisting of: 

(a) polypeptides according to claim 9; 

(b) polynucleotides according to claim 8; 

(c) antibodies according to claim 12; 

(d) fusion proteins according to claim 14; 

(e) T cell populations according to claim 18; and 

(f) antigen presenting cells that express a polypeptide according to 

claim 9. 



20. A method for stimulating an immune response in a patient, 
comprising administering to the patient a composition of claim 19. 
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21. A method for the treatment of a lung cancer in a patient, 
comprising administering to the patient a composition of claim 19. 

22. A method for determining the presence of a cancer in a patient, 
comprising the steps of: 

(a) obtaining a biological sample from the patient; 

(b) contacting the biological sample with an oligonucleotide 
according to claim 9; 

(c) detecting in the sample an amount of a polynucleotide that 
hybridizes to the oligonucleotide; and 

(d) compare the amount of polynucleotide that hybridizes to the 
oligonucleotide to a predetermined cut-off value, and therefrom determining the 
presence of the cancer in the patient. 

23. A diagnostic kit comprising at least one oligonucleotide 
according to claim 16. 

24. A diagnostic kit comprising at least one antibody according to 
claim 12 and a detection reagent, wherein the detection reagent comprises a reporter 
group. 

25. A method for the treatment of lung cancer in a patient, 
comprising the steps of: 

(a) incubating CD4+ and/or CD8+ T cells isolated from a patient 
with at least one component selected from the group consisting of: (i) polypeptides 
according to claim 9; (ii) polynucleotides according to claim 8; and (iii) antigen 
presenting cells that express a polypeptide of claim 9, such that T cell proliferate; 

(b) administering to the patient an effective amount of the 
proliferated T cells, 

and thereby inhibiting the development of a cancer in the patient. 
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SEQUENCE LISTING 

<110> Corixa Corporation 
Wang, Tongtong 
Wang, Aijun 
Skeiky, Yasir A.W. 
Li, Samual X. 
Kalos, Michael D. 
Henderson, Robert A. 
McNeill, Patricia D. 
Fanger, Neil 
Retter, Marc W. 
Durham, Margarita 
Fanger, Gary R. 
Vedvick, Thomas S. 
Carter, Darrick 
Watanabe, Yoshihiro 
Peckman, David W. 
Cai, Feng 
Foy, Teresa M. 



<120> COMPOSITIONS AND METHODS FOR THE THERAPY 
AND DIAGNOSIS OF LUNG CANCER 



<130> 210121. 45503PC 



<140> PCT 

<141> 2001-11-30 



<160> 469 



<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 315 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 
<222> 236, 241 
<223> n = A,T,C or G 



<400> 1 

gcagagacag actggtggtt gaacctggag gtgccaaaaa agccagctgc gggcccagga 60 
cagctgccgt gagactcccg atgtcacagg cagtctgtgt ggttacagcg cccctcagtg 12 0 
ttcatctcca gcagagacaa cggaggaggc tcccaccagg acggttctca ttatttatat 180 
gttaatatgt ttgtaaactc atgtacagtt ttttttgggg gggaagcaat gggaanggta 240 
naaattacaa atagaatcat ttgctgtaat ccttaaatgg caaacggtca ggccacgtga 300 
aaaaaaaaaa aaaaa 315 



<210> 2 
<211> 380 
<212> DNA 
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<213> Homo sapiens 
<400> 2 

atttaggctt aagattttgt ttacccttgt tactaaggag caaattagta ttaaagtata 60 
atatatataa acaaatacaa aaagttttga gtggttcagc ttttttattt tttttaatgg 120 
cataactttt aacaacactg ctctgtaatg ggttgaactg tggtactcag actgagataa 180 
ctgaaatgag tggatgtata gtgttattgc ataattatcc cactatgaag caaagggact 240 
ggataaattc ccagtctaga ttattagcct ttgttaacca tcaagcacct agaagaagaa 300 
ttattggaaa ttttgtcctc tgtaactggc actttggggt gtgacttatc ttttgccttt 3 60 
gtaaaaaaaa aaaaaaaaaa 380 

<210> 3 
<211> 346 
<212> DNA 

<213> Homo sapiens 

<220> i 
<221> misc_feature 

<222> 316, 317, 318, 322, 323, 326, 329, 330, 331, 336, 337, 339, 

340, 342, 343 

<223> n = A,T,C or G 

<400> 3 

ttgtaagtat acaattttag aaaggattaa atgttattga tcattttact gaatactgca 60 
catcctcacc atacaccatc cactttccaa taacatttaa tcctttctaa aattgtaagt 120 
atacaattgt actttctttg gattttcata acaaatatac catagactgt taattttatt 18 0 
gaagtttcct taatggaatg agtcattttt gtcttgtgct tttgaggtta cctttgcttt 240 
gacttccaac aatttgatca tatagtgttg agctgtggaa atctttaagt ttattctata 300 
gcaataattt ctattnnnag annccnggnn naaaannann annaaa 34 6 

<210> 4 
<211> 372 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 297, 306, 332 
<223> n = A,T,C or G 

<400> 4 

actagtctca ttactccaga attatgctct tgtacctgtg tggctgggtt tcttagtcgt 60 
tggtttggtt tggttttttg aactggtatg tagggtggtt cacagttcta atgtaagcac 120 
tctcttctcc aagttgtgct ttgtggggac aatcattctt tgaacattag agaggaaggc 18 0 
agttcaagct gttgaaaaga ctattgctta tttttgtttt taaagaccta cttgacgtca 240 
tgtggacagt gcacgtgcct tacgctacat cttgttttct aggaagaagg ggatgcnggg 300 
aaggantggg tgctttgtga tggataaaac gnctaaataa cacaccttta cattttgaaa 360 
aaaacaaaac aa 372 

<210> 5 
<211> 698 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 8, 345, 422, 430, 433, 436, 438, 472, 481, 486, 515, 521, 
536, 549, 553, 556, 557, 559, 568, 593, 597, 605, 611, 613, 
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616, 618, 620, 628, 630, 632, 634, 635, 639, 643, 647, 648, 
649, 652, 654, 658, 664, 690 
<223> n = A,T,C or G 

<400> 5 

actagtanga tagaaacact 
cctaacccag gttaactgca 
gcataaagcc aatgtagtcc 
caatacacac tcatgaactc 
gcacacttgc tagactcaga 
gacaacctac tttgcttggc 
gacatttagt tagtgctttt 
tntccaaatn ttngtncngt 
natgangtcc ctggtttttc 
ctaaaaccnt ctnctnnang 
tgtgngaaga nanccncncn 
gggngccgcc cccgcggggg 

<210> 6 
<211> 740 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_£eature 
<222> 82, 406, 426, 434, 462, 536, 551, 558, 563, 567, 582, 584, 
592, 638, 651, 660, 664, 673, 675, 697, 706, 711, 715, 716, 
717, 72,3, 724, 725, 733 
<223> n = A,T,C or G 

<400> 6 

actagtcaaa aatgctaaaa taatttggga gaaaatattt tttaagtagt gttatagttt 60 

catgtttatc ttttattatg tnttgtgaag ttgtgtcttt tcactaatta cctatactat 120 

gccaatattt ccttatatct atccataaca tttatactac atttgtaaga gaatatgcac 180 

gtgaaactta acactttata aggtaaaaat gaggtttcca agatttaata atctgatcaa 240 

gttcttgtta tttccaaata gaatggactt ggtctgttaa ggggctaagg gagaagaaga 300 

agataaggtt aaaagttgtt aatgaccaaa cattctaaaa gaaatgcaaa aaaaaattta 360 

ttttcaagcc ttcgaactat ttaaggaaag caaaatcatt tcctanatgc atatcatttg 420 

tgagantttc tcantaatat cctgaatcat tcatttcagc tnaggcttca tgttgactcg 480 

atatgtcatc tagggaaagt ctatttcatg gtccaaacct gttgccatag ttggtnaggc 54 0 

tttcctttaa ntgtgaanta ttnacangaa attttctctt tnanagttct tnatagggtt 600 

aggggtgtgg gaaaagcttc taacaatctg tagtgttncg tgttatctgt ncagaaccan 660 

aatnacggat cgnangaagg actgggtcta tttacangaa cgaatnatct ngttnnntgt 720 

gtnnncaact ccngggagcc 74 0 

<210> 7 

<211> 670 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<222> 265, 268, 457, 470, 485, 546, 553, 566, 590, 596, 613, 624, 
639, 653, 659, 661 
<223> n = A,T,C or G 

<400> 7 

gctggggagc tcggcatggc ggtccccgct gcagccatgg ggccctcggc gttgggccag 60 



gtgtcccgag agtaaggaga 
agaagaggcg ggatactttc 
agtttctaag atcatgttcc 
ctgatggaac aataacaggc 
aaaaatacta ctctcataaa 
tgagtgaagg aatgatattc 
tatataccag gcatgatgct 
cgctgcacat atctgaaatc 
cacgccactt gatcngtcaa 
gttagacngg acctctcttc 
cccccctncn tncnncctng 
gacccccccn ttttcccc 



gaagctacta ttgattagag 60 
agctttccat gtaactgtat 12 0 
aagctaactg aatcccactt 180 
ccaagcctgt ggtatgatgt 240 
tgggtgggag tattttgggt 300 
atatnttcat ttattccatg 360 
gagtgacact cttgtgtata 42 0 
ctatattaag antttcccaa 480 
ngatctcacc tctgtntgtc 540 
tcccttcccg aanaatnaag 600 
ccngctnnnc cncntgtngg 660 
698 
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agcggccccg gctcgatggc 
cttgggatgc aggagctgtt 
ccaaggtgca ctcggtggcc 
aagacgccac gtcttcttgc 
catggggata gtgtggacca 
cgtctggaga taaaaccatt 
tgaacactaa aggggagaac 
tagcnacaag gatgatgtgg 
aaacanttcc aanttcgaag 
tcctgacaat ggnccttggg 
natccacccc 

<210> 8 
<211> 689 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 253, 335, 410, 428, 448, 458, 466, 479, 480, 482, 483, 485, 
488, 491, 492, 495, 499, 500, 502, 503, 512, 516, 524, 525, 
526, 527, 530, 540, 546, 550, 581, 593, 594, 601, 606, 609, 
610, 620, 621, 622, 628, 641, 646, 656, 673 
<223> n = A,T,C or G 

<400> 8 

actagtatct aggaatgaac agtaaaagag gagcagttgg ctacttgatt acaacagagt 60 
aaatgaagta ctggatttgg gaaaacctgg ttttattaga acatatggaa tgaaagccta 12 0 
cacctagcat tgcctactta gccccctgaa ttaacagagc . ccaattgaga caaacccctg 18 0 
gcaacaggaa attcaaggga gaaaaagtaa gcaacttggg ctaggatgag ctgactccct 240 
tagagcaaag ganagacagc ccccattacc aaataccatt tttgcctggg gcttgtgcag 300 
ctggcagtgt tcctgcccca gcatggcacc ttatngtttt gatagcaact tcgttgaatt 360 
ttcaccaact tattacttga aattataata tagcctgtcc gtttgctgtn tccaggctgt 420 
gatatatntt cctagtggtt tgactttnaa aataaatnag gtttantttt ctccccccnn 480 
cnntnctncc nntcnctcnn cnntcccccc cnctcngtcc tccnnnnttn gggggggccn 54 0 
cccccncggn ggacccccct ttggtccctt agtggaggtt natggcccct ggnnttatcc 600 
nggccntann tttccccgtn nnaaatgntt ccccctccca ntcccnccac ctcaanccgg 660 
aagcctaagt ttntaccctg ggggtcccc 68 9 

<210> 9 

<211> 674 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> 602, 632, 639, 668 
<223> n = A,T,C or G 

<400> 9 

gtccactctc ctttgagtgt actgtcttac tgtgcactct gtttttcaac tttctagata 60 

taaaaaatgc ttgttctata gtggagtaag agctcacaca cccaaggcag caagataact 120 

gaaaaaagcg aggctttttt gccaccttgg taaaggccag ttcactgcta tagaactgct 18 0 

ataagcctga agggaagtag ctatgagact ttccattttt cttagttctc ccaataggct 24 0 

ccttcatgga aaaaggcttc ctgtaataat tttcacctaa tgaattagca gtgtgattat 300 

ttctgaaata agagacaaat tgggccgcag agtcttcctg tgatttaaaa taaacaaccc 360 

aaagttttgt ttggtcttca ccaaaggaca tactctaggg ggtatgttgt tgaagacatt 420 

caaaaacatt agctgttctg tctttcaatt tcaagttatt ttggagactg cctccatgtg 480 



cccgtggtgc tcagtgagca 
ccggggccac agcaagaccg 
tggagttgcg acgggcgtcg 
tgganaanga ccgttggtca 
ctttgttggc atccaagtaa 
cgcatctggg atgtgaggac 
attaatatct gctggantcc 
tgactttatt gatgccaaga 
tcaccnaaat ctcctggaac 
tgtntcacat cctcagctnc 



gcggcccgtc gcgctacgtg 120 
cgagttcctg gcgcacagcg 180 
cctacctcgg ggtcttcgac 24 0 
aagaaaacaa ttatcgggga 30 0 
tcctgaccta tttgttacgg 3 60 
tacaaaatgc attgccactg 42 0 
tgatgggcan accattgctg 480 
aaccccgttc caaagcaaaa 54 0 
aatgaacatn aatatnttct 60 0 
cccaaaactg aancctgtnc 660 
670 
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agttaattac tttgctctgg aactagcatt attgtcatta tcatcacatt ctgtcatcat 54 0 
catctgaata atattgtgga tttccccctc tgcttgcatc ttcttttgac tcctctggga 60 0 
anaaatgtca aaaaaaaagg tcgatctact cngcaaggnc catctaatca ctgcgctgga 660 
aggacccnct gccc 67 4 



<210> 10 

<211> 346 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 320, 321, 322, 325, 326, 328, 329, 330, 332, 333, 334, 335, 
342 

<223> n = A, T, C or G 



<400> 10 

actagtctgc tgatagaaag cactatacat 
ttctgtctgt aacaaaaatg tactttatag 
ccttaagtgt ttctgtcatt gttcaagtgt 
tttttctttt ccccttataa attgtaattc 
tgtcagatta tattatctaa caattgaata 
aaagggtact tttctattan nnagnngnnn 



cctattgttt ctttctttcc aaaatcagcc 60 
agatggagga aaaggtctaa tactacatag 120 
attttctgta acagaaacat atttggaatg 180 
ctgaaatact gctgctttaa aaagtcccac 240' 
ttgtaaatat acttgtctta cctctcaata 300 
gnnnnataaa anaaaa 34 6 



<210> 11 

<211> 602 

<212> DNA 

<213> Homo sapiens 



<400> 11 

actagtaaaa agcagcattg ccaaataatc 
gatgttaagc tttttgaaaa gtttaggtta 
tgcttccctt tatctggaat gtggcattag 
ttcaattcca tgacttaagg ttggagagct 
cagttttgca taattataat cggcattgta 
atctgcactt tctaaatatc aaaaaaggga 
ctgtttgaaa catgagtttt atttgcttaa 
tcttgggatc ctgtgtagaa ctgttctcat 
gtactagcta caaattcggt ttcatattct 
ctagatggtc tacttctgtt catataaaaa 
aa 



cctaattttc cactaaaaat ataatgaaat 60 
aacctactgt tgttagatta atgtatttgt 120 
cttttttatt ttaaccctct ttaattctta 180 
aaacactggg atttttggat aacagactga 240 
catagaaagg atatggctac cttttgttaa 300 
aatgaagtta taaatcaatt tttgtataat 3 60 
tattagggct ttgccccttt tctgtaagtc 420 
taaacaccaa acagttaagt ccattctctg 480 
acttaacaat ttaaataaac tgaaatattt 540 
caaaacttga tttccaaaaa aaaaaaaaaa 600 
602 



<210> 12 

<211> 685 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 170, 279, 318, 321, 322, 422, 450, 453, 459, 467, 468, 470, 
473, 475, 482, 485, 486, 491, 498, 503, 506, 509, 522, 526, 
527, 528, 538, 542, 544, 551, 567, 568, 569, 574, 576, 582, 
587, 588, 589, 590, 592, 593, 598, 599, 603, 605, 608 
<223> n = A,T,C or G 

<221> misc_feature 

<222> 633, 634, 635, 644, 646, 648, 651, 655, 660, 662, 663, 672, 
674, 675, 682, 683 
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<223> n = A,T,C or G 
<400> 12 

actagtcctg tgaaagtaca actgaaggca 
attatcatgg tattgatgga cctaagaaaa 
gcatgcattt gtaacatgat tagtagattt 
aggtgtttta tcattatgta aaggaattaa 
atatgcatat agtagagtgc aaaaatatag 
tttagatatg ccttaatnta nnaactgtgc 
agaccagtgc ctgggtggtg cctccccttg 
angtagtgcc ctcgtaggtg tcacgtggan 
ancanngtga nagtttcncc gtngangcng 
cntntccaat ngacaatcga gtttccnnnc 
cantntgnta accccgcgcc cggatcgctc 
cnnccgccgt cncrmccccg cnncc 

<210> 13 
<211> 694 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 503, 546, 599, 611, 636, 641, 643, 645, 656, 658, 662, 676, 
679, 687 

<223> n = A,T,C or G 
<400> 13 

cactagtcac tcattagcgt tttcaatagg gctcttaagt ccagtagatt acgggtagtc 60 

agttgacgaa gatctggttt acaagaacta attaaatgtt tcattgcatt tttgtaagaa 120 

cagaataatt ttataaaatg tttgtagttt ataattgccg aaaataattt aaagacactt 180 

tttctctgtg tgtgcaaatg tgtgtttgtg atccattttt tttttttttt taggacacct 240 

gtttactagc tagctttaca atatgccaaa aaaggatttc tccctgaccc catccgtggt 300 

tcaccctctt ttccccccat gctttttgcc ctagtttata acaaaggaat gatgatgatt 360 

taaaaagtag ttctgtatct tcagtatctt ggtcttccag aaccctctgg ttgggaaggg 420 

gatcattttt tactggtcat ttccctttgg agtgtactac tttaacagat ggaaagaact 480 

cattggccat ggaaacagcc gangtgttgg gagccagcag tgcatggcac cgtccggcat 540 

ctggcntgat tggtctggct gccgtcattg tcagcacagt gccatgggac atggggaana 600 

ctgactgcac ngccaatggt tttcatgaag aatacngcat ncncngtgat cacgtnancc 660 

angacgctat gggggncana gggccanttg cttc 694 

<210> 14 

<211> 679 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 29, 68, 83, 87, 94, 104, 117, 142, 145, 151, 187, 201, 211, 
226, 229, 239, 241, 245, 252, 255, 259, 303, 309, 359, 387, 
400, 441, 446, 461, 492, 504, 505, 512, 525, 527, 533, 574, 
592, 609, 610, 618, 620, 626, 627, 633, 639, 645, 654 
<223> n = A,T,C or G 

<400> 14 

cagccgcctg catctgtatc cagcgccang tcccgccagt cccagctgcg cgcgcccccc 60 
agtcccgnac ccgttcggcc cangctnagt tagncctcac catnccggtc aaaggangca 120 
ccaagtgcat caaatacctg cngtncggat ntaaattcat cttctggctt gccgggattg 18 0 



gaaagtgtta ggattttgca tctaatgttc 60 
taaaaattag actaagcccc caaataagct 120 
gaatatatag atgtagtatn ttgggtatct 18 0 
agtaaaggac tttgtagttg tttttattaa 24 0 
caaaaatana aactaaaggt agaaaagcat 300 
caggtggccc tcggaataga tgccaggcag 360 
tctgcccccc tgaagaactt ccctcacgtg 42 0 
tantggganc aggccgnncn gtnanaagaa 48 0 
aactgtccct gngccnnnac gctcccanaa 540 
tccngnaacc tngccgnnnn cnngcccnnc 600 
tcnnntcgtt ctcncncnaa ngggntttcn 660 
685 
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ctgtccntgc cattggacta nggctccgat ncgactctca gaccanganc atcttcganc 24 0 
naganactaa tnatnattnt tccagcttct acacaggagt ctatattctg atcggatccg 300 
gcnccctcnt gatgctggtg ggcttcctga gctgctgcgg ggctgtgcaa gagtcccant 360 
gcatgctggg actgttcttc ggcttcntct tggtgatatn cgccattgaa atacctgcgg 42 0 
ccatctgggg atattccact ncgatnatgt gattaaggaa ntccacggag ttttacaagg 480 
acacgtacaa cnacctgaaa accnnggatg anccccaccg ggaancnctg aangccatcc 54 0 
actatgcgtt gaactgcaat ggtttggctg gggnccttga acaatttaat cncatacatc 600 
tggccccann aaaggacntn ctcganncct tcnccgtgna attcngttct gatnccatca 660 
cagaagtctc gaacaatcc 67 9 



<210> 15 

<211> 695 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 105, 172, 176, 179, 189, 203, 212, 219, 221, 229, 231, 238, 
242, 261, 266, 270, 278, 285, 286, 298, 311, 324, 337, 350, 
363, 384, 391, 395, 405, 411, 424, 427, 443, 448, 453, 455, 
458, 463, 467, 470, 479, 482, 484, 493, 499, 505, 518 
<223> n = A,T,C or G 

<221> misc_feature 

<222> 520, 523, 531, 540, 584, 595, 597, 609, 611, 626, 628, 651, 
652, 657, 661, 665, 669, 672, 681, 683, 691, 693 
<223> n = A,T,C or G 



<400> 15 

actagtggat aaaggccagg gatgctgctc 
cattacaact acccaatccg aagtgtcaac 
ttaaaaaagg gcctgaaaaa aggggagcca 
tggcaaatna gcattctgtc tcnttggctg 
cnggcccagg aatacatctc ncaatnaacn 
tgggattatc ntccgcttgt tgancttcta 
ccnagttctg ttagaaaaat gccngaattc 
tctncanaaa cttcctggcc acnattcnaa 
ancncacccc acntttgana gccangacaa 
aactttgaaa ggaaaaaaaa ctttgtttcc 
tgccttctng naaccctgga agcccngnga 
ncttnaatnt cnatcttccc nanaacgatt 

<210> 16 

<211> 669 

<212> DNA 

<213> Homo sapiens 



aacctcctac catgtacagg gacgtctccc 60 
tgtgtcagga ctaanaaacc ctggttttga 120 
caaatctgtc tgcttcctca cnttantcnt 180 
cngcctcanc ncaaaaaanc ngaactcnat 24 0 
aaattganca aggcnntggg aaatgccnga 300 
agtttcnttc ccttcattcn accctgccag 360 
naacnccggt tttcntactc ngaatttaga 420 
ttnanggnca cgnacanatn ccttccatna 480 
tgactgcntn aantgaaggc ntgaaggaan 54 0 
ggccccttcc aacncttctg tgttnancac 600 
cagtgttaca tgttgttcta nnaaacngac 660 
ncncc 695 



<220> 

<221> misc_feature 

<222> 299, 354, 483, 555, 571, 573, 577, 642, 651, 662, 667 
<223> n = A, T, C or G 



<400> 16 

cgccgaagca gcagcgcagg ttgtccccgt 
ttcccgggcc ccttacactc cacagtcccg 
agaaccctgc ggaggagacc ggcgaggaga 
tgcctgagag agctgaagag gcaaagctaa 
ctggaggctc cgacttcctc atgaagagac 



ttcccctccc ccttcccttc tccggttgcc 60 
gtcccgccat gtcccagaaa caagaagaag 12 0 
agcaggacac gcaggagaaa gaaggtattc 180 
aggccaaata cccaagccta ggacaaaagc 240 
tccagaaagg gcaaaagtac tttgactcng 300 
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gagactacaa catggccaaa gccaacatga agaataagca gctgccaagt gcangaccag 3 60 
acaagaacct ggtgactggt gatcacatcc ccaccccaca ggatctgccc agagaaagtc 42 0 
ctcgctcgtc accagcaagc ttgcgggtgg ccaagttgaa tgatgctgcc ggggctctgc 480 
canatctgag acgcttccct ccctgcccca cccgggtcct gtgctggctc ctgcccttcc 54 0 
tgcttttgca gccangggtc aggaagtggc ncnggtngtg gctggaaagc aaaacccttt 600 
cctgttggtg tcccacccat ggagcccctg gggcgagccc angaacttga ncctttttgt 660 
tntcttncc 669 



<210> 17 

<211> 697 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 33, 48, 50, 55, 59, 60, 76, 77, 78, 90, 113, 118, 130, 135, 
141, 143, 150, 156, 166, 167, 170, 172, 180, 181, 190, 192, 
194, 199, 201, 209, 212, 224, 225, 226, 230, 233, 234, 236, 
242, 244, 251, 253, 256, 268, 297, 305, 308, 311, 314 
<223> n = A,T,C or G 

<221> misc_feature 

<222> 315, 317, 322, 324, 327, 333, 337, 343, 362, 364, 367, 368, 
373, 384, 388, 394, 406, 411, 413, 423, 429, 438, 449, 450, 
473, 476, 479, 489, 491, 494, 499, 505, 507, 508, 522, 523, 
527, 530, 533, 535, 538, 539, 545, 548, 550, 552, 555 
<223> n = A,T,C or G 

<221> misc_feature 

<222> 562, 563, 566, 568, 572, 577, 578, 580, 581, 591, 594, 622, 
628, 632, 638, 642, 644, 653, 658, 662, 663, 665, 669, 675, 
680, 686, 689 
<223> n = A, T, C or G 



<400> 17 

gcaagatatg gacaactaag tgagaaggta 
gacgcgctga ggagannnac gctggcccan 
gcctgcccan gggancccca ncnctcggan 
ncctggctcn cncngcccng nccagctcnc 
cncnccctcc ncnacnacct cctacccncg 
ccacnacncc ntcnncncga ancnccnctc 
cncnacnncg cgntcccccg cgcncgcngc 
agncacgcnc tccgcccnct gacgccccim 
ccccgctcnc nccnctgcnc gccgncnngg 
ccccngcngn angcngtgcg cnncangncc 
cgcccgctgg gggctcccgc cncgcggntc 
cnncnctcnc gctcngcgcn cgcccnccnc 

<210> 18 

<211> 670 " 

<212> DNA 

<213> Homo sapiens 



atnctctact gctctagntn ctccnggcnn 60 
ctgccggcca cacacgggga tcntggtnat 120 
cccatntcac acccgnnccn tncgcccacn 180 
gnccccctcc gccnnnctcn ttnncntctc 24 0 
gctccctccc cagccccccc ccgcaancct 300 
gcnctcngcc ccngccccct gccccccgcc 360 
ctcnccccct cccacnacag ncncacccgc 420 
cccgccgcgc tcaccttcat ggnccnacng 480 
cgccccgccc cnnccgngtn ccncncgnng 54 0 
gngccgnncn ncaccctccg nccnccgccc 600 
antccccncc cntncgccca ctntccgntc 660 
ccccccc 697 



<220> 

<221> misc_feature 

<222> 234, 292, 329, 437, 458, 478, 487, 524, 542, 549, 550, 557, 
576, 597, 603, 604, 646, 665 
<223> n = A,T,C or G 
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<400> 18 

ctcgtgtgaa gggtgcagta cctaagccgg agcggggtag aggcgggccg gcaccccctt 60 
ctgacctcca gtgccgccgg cctcaagatc agacatggcc cagaacttga acgacttggc 12 0 
gggacggctg cccgccgggc cccggggcat gggcacggcc ctgaagctgt tgctgggggc 18 0 
cggcgccgtg gcctacggtg tgcgcgaatc tgtgttcacc gtggaaggcg ggcncagagc 24 0 
catcttcttc aatcggatcg gtggagtgca caggacacta tcctgggccg anggccttca 300 
cttcaggatc cttggttcca gtaccccanc atctatgaca ttcgggccag acctcgaaaa 360 
aatctcctcc ctacaggctc caaagaccta cagatggtga atatctccct gcgagtgttg 42 0 
tctcgaccaa tgctcangaa cttcctaaca tgttccancg cctaagggct ggactacnaa 48 0 
gaacgantgt tgccgtccat tgtcacgaag tgctcaagaa tttnggtggc caagttcaat 54 0 
gncctcacnn ctgatcnccc agcggggcca agttanccct ggttgatccc cgggganctg 60 0 
acnnaaaagg gccaaggact tcccctcatc ctggataatg tggccntcac aaagctcaac 660 
tttanccacc 670 



<210> 19 

<211> 606 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 506 

<223> n = A,T,C or G 



<400> 19 

actagtgcca acctcagctc ccaggccagt 
tggcctcagt tgtccttggt tattgatggg 
tgtcgccttg gctcaactgt ggttgatttg 
ccaggctgtg ccctggaaag tactacagcc 
tcacatgcgt cctacctgtg aaactctggg 
tactatgtgt ctgtccactg acgactgtca 
gggcactagc ctgactttta aggcagtgtg 
gagctgctgg tttagccttg cacctgggga 
cagccaaaag ctgaatggaa aagttnagaa 
tcttctgtct gttttgtttt tcaattgaaa 
gagacc 



tctctgaatg tcgaggagtt ccaggatctc 60 
ggacaaattg gggatggcca gagccccgag 12 0 
tctgtgcccg gaaagtttgg catcattcgt 180 
atcctccaac agaagtacgg actgctcccc 240 
aagcaggaag gcccaagacc tggtgctgga 300 
aggcctcatt tgcagaggcc accggagcta 360 
tctttctgag cactgtagac caagcccttg 420 
aaggatgtat ttatttgtat tttcatatat 480 
cattcctagg tggccttatt ctaataagtt 54 0 
agttattaaa taacagattt agaatctagt 600 
606 



<210> 20 

<211> 449 

<212> DNA 

<213> Homo sapiens 



<400> 20 

actagtaaac aacagcagca gaaacatcag 
cagcgccaga gccgaggaga acccccgctc 
ccaccacagc cgcctgccag gatggactcg 
tgccagaaca tcaaggagtt cactgcccaa 
cttcaagaat acaacaacta agaaaaggaa 
tgaagtcaca ccagggcaac tcttggaaga 
atttctttag tgtcattgcc gattttggct 
aaaacaaaat cttgactgct tgctcaaaa 

<210> 21 

<211> 409 

<212> DNA 

<213> Homo sapiens 



tatcagcagc gtcgccagca ggagaatatg 60 
cctgaggagg acctgtccaa actcttcaaa 12 0 
ctgctcattg caggccagat aaacacttac 18 0 
aacttaggca agctcttcat ggcccaggct 24 0 
gtttccagaa aagaagttaa catgaactct 300 
aatatatttg catattgaaa agcacagagg 3 60 
ataacagtgt ctttctagcc ataataaaat 42 0 
449 
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<400> 21 

tatcaatcaa ctggtgaata attaaacaat gtgtggtgtg atcatacaaa gggtaccact 60 
caatgataaa aggaacaagc tgcctatatg tggaacaaca tggatgcatt tcagaaactt 12 0 
tatgttgagt gaaagaacaa acacggagaa catactatgt ggttctcttt atgtaacatt 180 
acagaaataa aaacagaggc aaccaccttt gaggcagtat ggagtgagat agactggaaa 24 0 
aaggaaggaa ggaaactcta cgctgatgga aatgtctgtg tcttcattgg gtggtagtta 300 
tgtggggata tacatttgtc aaaatttatt gaactatata ctaaagaact ctgcatttta 360 
ttgggatgta aataatacct caattaaaaa gacaaaaaaa aaaaaaaaa 409 

<210> 22 

<211> 649 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<222> 263, 353, 610, 635, 646 

<223> n = A, T, C or G 



<400> 22 

acaattttca ttatcttaag cacattgtac 
tgataaggat ggtacttgca tatggtgaat 
tatttcagtg gaccaacatt gtggcatggc 
caaatctaca agagaccctg gttggttttt 
tcctgaatca gcagggatgg aangagggta 
agctctgaag tgtcacattt aatatcagtt 
aagagagaag aaagaggaag tgttcacttt 
ttatatcagt agttctgagg tattgatagc 
gttgaagcag ggtgaataac taggggcata 
gatgttttct ttggaatttc cggataagtt 
ctgaagttcn tatccatctc attacaacaa 

<210> 23 

<211> 669 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 642, 661 
<223> n = A,T,C or G 



atttctacag aacctgtgat tattctcgca 60 
tactactgtt gacagtttcc gcagaaatcc 120 
agcaaatgcc aacattttgt ggaatagcag 180 
cgttttgttt tctttgtttt ttcccccttc 240 
gggaagttat gaattactcc ttccagtagt 300 
ttttttaaac atgattctag ttnaatgtag 360 
tttaatacac tgatttagaa atttgatgtc 420 
ttgctttatt tctgccttta cgttgacagt 480 
tatatttttt ttttttgtaa gctgtttcat 54 0 
caggaaaaca tctgcatgtt gttatctagt 600 
aaacncccag aacggnttg 64 9 



<400> 23 

actagtgccg tactggctga aatccctgca 
tactctcagt caccagctct ggaattagat 
tatcctctga cagcctttgg gctgcctcgg 
tcacctgtcg tgcccccctc tgtcaagact 
cgcaaggtgg tgctgatgca gtgcaacatt 
ctgacacttc tgctgaagtt ggaggacaaa 
ccaaatgaga atatccccga gttggcggct 
gctgaccaga gccggttgac ttctctgcta 
ggaacagtac cctcaactca gccgctgtca 
gccctgatct gcgctgtggc tgtcctggac 
agtattacct gtgaagccct tccctccttt 
nttctaacc 



ggaccaggaa gagaaccagt tcagactttg 60 
aaattccttg aagatgtcag gaatgggatc 120 
ccccagcagc cacagcagga ggaggtgaca 180 
ccgacacctg aaccagctga ggtggagact 240 
gagtcggtgg aggagggagt caaacaccac 300 
ctgaaccggc acctgagctg tgacctgatg 360 
gagctggtgc agctgggctt cattagtgag 420 
gaagagactt gaacaagttc aattttgcca 480 
ccgtctcctc ttagagctca ctcgggccag 54 0 
gtgctgcacc ctctgtcctt ccccccagtc 600 
attattcagg anggctgggg gggctccttg 660 
669 



<210> 24 
<211> 442 
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<212> DNA 

<213> Homo sapiens 



<400> 24 

actagtacca tcttgacaga ggatacatgc tcccaaaacg tttgttacca cacttaaaaa 60 
tcactgccat cattaagcat cagtttcaaa attatagcca ttcatgattt actttttcca 120 
gatgactatc attattctag tcctttgaat ttgtaagggg aaaaaaaaca aaaacaaaaa 18 0 
cttacgatgc acttttctcc agcacatcag atttcaaatt gaaaattaaa gacatgctat 240 
ggtaatgcac ttgctagtac tacacacttt ggtacaacaa aaaacagagg caagaaacaa 300 
cggaaagaga aaagccttcc tttgttggcc cttaaactga gtcaagatct gaaatgtaga 360 
gatgatctct gacgatacct gtatgttctt attgtgtaaa taaaattgct ggtatgaaat 420 
gacctaaaaa aaaaaaaaga aa 442 

<210> 25 

<211> 656 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 330, 342, 418, 548, 579, 608 
<223> n = A,T,C or G 



<400> 25 

tgcaagtacc acacactgtt tgaattttgc 
ccccggaatg tacagtgtct tggtgcacca 
accctaatgg ggcagagagt atagccctag 
aggcctgagg tagaggggag tggtatgtgt 
gacaggatgt tagataaagg ctctagttag 
ctcctagcag ctggtaaagg ggtgctggan 
gggctgatct gattacttcc tggcatcccg 
atgggacagt tttccatatc cttgctgtgg 
attaaaaatc actgccctaa ctacacttcc 
tgacatantt cttggcatgg ggagccagcc 
ctcctganac tcatctacat agaattggtt 

<210> 26 
<211> 434 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 395 

<223> n = A,T,C or G 



acaaaaagtg actgtaggat caggtgatag 60 
agatgccttc taaaggctga cataccttgg 120 
cccagtggtg acatgaccac tccctttggg 18 0 
tttctcagtg gaagcagcac atgagtgggt 240 
ggtgtcattg tcatttgaga gactgacaca 300 
gccatggagg anctctagaa acattagcat 360 
ctcactttta tgggaagtct tattagangg 420 
agctctggaa cactctctaa atttccctct 480 
tccttgaagg aatagaaatg gaactttctc 540 
acaaatgana atctgaacgt gtccaggttt 600 
aaaccctccc ttggaataag gaaaaa 656 



<400> 26 

actagttcag actgccacgc caaccccaga 
ctaggtgttt ccatctatgt ttcaatctgt 
acaaaaaaac gctgccaggt tttagaagca 
caccagggtt cttttgaaat agtaccacat 
aataactgaa ttgtcaggct ttgattgata 
gaataagtta taatcagtat tcatctcttt 
gtcatttgta ctgtttgaaa aatatttctt 
aaaaaaaaaa aaaa 



aaatacccca catgccagaa aagtgaagtc 60 
ccatctacca ggcctcgcga taaaaacaaa 120 
gttctggtct caaaaccatc aggatcctgc 18 0 
gtaaaaggga atttggcttt cacttcatct 240 
attgtagaaa taagtagcct tctgttgtgg 300 
gttttttgtc actcttttct ctctaattgt 360 
ctatnaaatt aaactaacct gccttaaaaa 42 0 
434 



<210> 27 
<211> 654 



PCT/US01/47576 



<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 505, 533, 563, 592, 613, 635, 638 
<223> n = A,T,C or G 



<400> 27 

actagtccaa cacagtcaga aacattgttt 
taataaacca ggatccattt aggtaccact 
tttatactgc atcctttaca ttagccacta 
cagaatccta tggattgcag catttcactt 
gcagtttctc aaaagcagaa acatgccgcc 
gaatgtaagg gcagctggcc cccaatgtgg 
ttcttgttcg cggctaaatg acagtttctg 
gtgttgattt acaaagaggc cagctaatag 
attcaagctg tgagccaggc agganctcag 
ggtacaaaaa aaattttaaa gcntttatgt 
aattgttaag aanaatttta agtgtccaga 

<210> 28 

<211> 670 

<212> DNA 

<213> Homo sapiens 



tgaatcctct gtaaaccaag gcattaatct 60 
tgatataaaa aggatatcca taatgaatat 120 
aatacgttat tgcttgatga agacctttca 18 0 
ggctacttca tacccatgcc ttaaagaggg 24 0 
agttctcaag ttttcctcct aactccattt 300 
ggaggtccga acattttctg aattcccatt 360 
tcattactta gattccgatc tttcccaaag 420 
cagaaatcat gaccctgaaa gagagatgaa 480 
tatggcaaag gtcttgagaa tcngccattt 54 0 
tataccatgg aaccatagaa anggcaaggg 600 
cccanaanga aaaaaaaaaa aaaa 654 



<220> 

<221> misc_feature 

<222> 101, 226, 274, 330, 385, 392, 397, 402, 452, 473, 476, 532, 
534, 538, 550, 583, 595, 604, 613, 622, 643, 669 
<223> n = A, T, C or G 



<400> 28 

cgtgtgcaca tactgggagg atttccacag ctgcacggtc acagccctta cggattgcca 60 
ggaaggggcg aaagatatgt gggataaact gagaaaagaa nccaaaaacc tcaacatcca 120 
aggcagctta ttcgaactct gcggcagcgg caacggggcg gcggggtccc tgctcccggc 180 
gttcccggtg ctcctggtgt ctctctcggc agctttagcg acctgncttt ccttctgagc 240 
gtggggccag ctccccccgc ggcgcccacc cacnctcact ccatgctccc ggaaatcgag 300 
aggaagatca ttagttcttt ggggacgttn gtgattctct gtgatgctga aaaacactca 360 
tatagggaat gtgggaaatc ctganctctt tnttatntcg tntgatttct tgtgttttat 42 0 
ttgccaaaat gttaccaatc agtgaccaac cnagcacagc caaaaatcgg acntcngctt 48 0 
tagtccgtct tcacacacag aataagaaaa cggcaaaccc accccacttt tnantttnat 540 
tattactaan ttttttctgt tgggcaaaag aatctcagga acngccctgg ggccnccgta 600 
ctanagttaa ccnagctagt tncatgaaaa atgatgggct ccncctcaat gggaaagcca 660 
agaaaaagnc 670 

<210> 29 
<211> 551 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 336, 474, 504, 511, 522, 523, 524, 540, 547 
<223> n = A,T,C or G 



<400> 29 

actagtcctc cacagcctgt gaatccccct agacctttca agcatagtga gcggagaaga 60 



i 
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agatctcagc gtttagccac cttacccatg cctgatgatt ctgtagaaaa ggtttcttct 120 
ccctctccag ccactgatgg gaaagtattc tccatcagtt ctcaaaatca gcaagaatct 180 
tcagtaccag aggtgcctga tgttgcacat ttgccacttg agaagctggg accctgtctc 24 0 
cctcttgact taagtcgtgg ttcagaagtt acagcaccgg tagcctcaga ttcctcttac 300 
cgtaatgaat gtcccagggc agaaaaagag gatacncaga tgcttccaaa tccttcttcc 360 
aaagcaatag ctgatgggaa gaggagctcc agcagcagca ggaatatcga aaacagaaaa 42 0 
aaaagtgaaa ttgggaagac aaaagctcaa cagcatttgg taaggagaaa aganaagatg 480 
aggaaggaag agagaagaga gacnaagatc nctacggacc gnnncggaag aagaagaagn 54 0 
aaaaaanaaa a 551 



<210> 30 

<211> 684 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 545, 570, 606, 657, 684 

<223> n = A,T,C or G 



<400> 30 

actagttcta tctggaaaaa gcccgggttg 
cgagactcat ttcttggaag catccctggc 
gtgatagaac ctggactgct ttttgagata 
agcacctctc agttgaatga attaatgatg 
ccacgagaga tgactgcaga tgtaatcgag 
ggtggtgata ttcgtgaaga gtcttcctat 
aaatgccccc gttgttggaa gtatacagcg 
tgcagaagtt gtcagtggga aaatagtatt 
cagtactggg ctagaagttt ggatggatta 
aggtnatgag tggatgagta aatggtggan 
aagttnttcc tgttactata gaaaggaatt 
tgtggtgtgt accgtggatg gaan 

<210> 31 
<211> 654 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 326, 582, 651 
<223> n = A,T,C or G 



gaagaagctg tggagagtgc gtgtgcaatg 60 
aaaaatgcag ctgagtacaa ggttatcact 120 
atagagatgc tgcagtctga agagacttcc 180 
gcttctgagt caactttact ggctcaggaa 240 
cttaaaggga aattcctcat caacttagaa 300 
aaagtaattg tcatgccgac tacgaaagaa 360 
ggagtcttca gatacactgt gtcctcgatg 420 
aacagctcac tcgagcaaga accctcctga 480 
tttacaatat aggaaagaaa gccaagaatt 54 0 
gatggggaat tcaaatcaga attatggaag 600 
atgtttattt acatgcagaa aatatanatg 660 
684 



<400> 31 

gcgcagaaaa ggaaccaata tttcagaaac 
aacatcttct cagaatgacc cagaagttat 
tttggcagct gtgctttcca gagatggaag 
agagcctgac agaatagttg gagaattcct 
ccttggtctt ggagatacag tggaaggtct 
tcatgatcag ggaaagcaaa tcagangttc 
aagtgcagag tggaagagct ttccatcacg 
ctatggcaga gcccaatgca aagtttattg 
atgatgttgt gatgggagtt cagtacaagg 
catgctccac tgactgttgt tgcagatggg 
tcaataaagt ttctgtatca ctcatttggt 



aagcttaata ggaacagctg cctgtacatc 60 
catcgtggga gctggcgtgc ttggctctgc 120 
aaaggtgaca gtcattgaga gagacttaaa 180 
gcagccgggt ggttatcatg ttctcaaaga 240 
tgatgcccag gttgtaaatg gttacatgat 300 
agattcctta ccctctgtca gaaaacaatc 3 60 
gaagattcat catgagtctc cggaaagcag 42 0 
aaggtgttgt gttacagtta ttagaggaag 48 0 
ataaagagac tgggagatat caaggaactc 54 0 
cttttctcca anttcaggaa aagcctggtc 600 
tggcttctta tgaagaatgc nccc 654 



<210> 32 
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<211> 673 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_£eature 
<222> 376, 545, 627 
<223> n = A,T,C or G 



<400> 32 

actagtgaag aaaaagaaat tctgatacgg 
tatcacctga caccaggagt tttcattgga 
ttaaagacca cacaaggaag caaaatcttt 
aatgaattga aatcaaaaga atctgacatc 
gataaactcc tctatccagc agacacacct 
aataaattaa tcaaatacat ccaaattaag 
cccgtgactg tctatnagcc aattattaaa 
tgtgggaaat aactgaaaaa gagaccgaga 
atacctagga tttctactgg aggtggagaa 
aagangtccc aaggtcacca aattcattga 
gaaattaaaa gacgcttcag ggagacnccc 
cagggattag aaa 



gacaaaaatg ctcttcaaaa catcattctt 60 
aaaggatttg aacctggtgt tactaacatt 12 0 
ctgaaagaag taaatgatac acttctggtg 180 
atgacaacaa atggtgtaat tcatgttgta 240 
gttggaaatg atcaactgct ggaaatactt 300 
tttgttcgtg gtagcacctt caaagaaatc 360 
aaatacacca aaatcattga tgggagtgcc 42 0 
agaacgaatc attacaggtc ctgaaataaa 480 
acagaagaac tctgaagaaa ttgttacaag 54 0 
aggtggtgat ggtctttatt tgaagatgaa 600 
catgaaggaa ttgccagcca caaaaaaatt 660 
673 



<210> 33 
<211> 673 
<212> DNA 
<213> Homo sapiei 



<220> 

<221> misc_feature 

<222> 325, 419, 452, 532, 538, 542, 571, 600, 616, 651, 653, 672 
<223> n = A,T,C or G 



<400> 33 

actagttatt tactttcctc cgcttcagaa 
ggatctgttg tttcttttgg gtctcacctc 
gaaggttgaa aggagcaggg aaaagatcca 
tcttgaagta tgatgcatat tgcattattt 
atcatttaga agggcaagtt caagaggata 
tgactaaaaa tgaacattaa tgttnaagac 
tgaaattatg caactttgat atcatattcc 
gaaactttat aaagcatatg gtcagttatt 
ctgcacttaa agaagtctaa cagtacaaat 
tntattttta aatattgtac tatttatggt 
aatttatcat ttcaanggca ttctatttgg 
ttcgctactg tnt 



ggtttttcag actgagagcc taagcatact 60 
atcagtgtgc atagtggcag aaattataaa 12 0 
gaagcatgtt agttcgacat catcatcttt 18 0 
tatttgcaaa ctaggaattg cagtctgagg 240 
tgaagatttg agaacttttt aactattcat 300 
ttaagacttt aacctgctgg cagtcccaaa 360 
ttgatttaaa ttgggctttt gtgattgant 420 
tnattaaaaa ggcaaaacct gaaccacctt 48 0 
acctatctat cttagatgga tntatttntt 54 0 
nggtggggct ttcttactaa tacacaaatn 600 
gtttagaagt tgattccaag nantgcatat 660 
673 



<210> 34 
<211> 684 
<212> DNA 
<213> Homo sapie: 



<220> 

<221> misc_feature 

<222> 414, 472, 480, 490, 503, 507, 508, 513, 523, 574, 575, 598, 

659, 662, 675 

<223> n = A,T,C or G 
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<400> 34 

actagtttat tcaagaaaag aacttactga ttcctctgtt cctaaagcaa gagtggcagg 60 
tgatcagggc tggtgtagca tccggttcct ttagtgcagc taactgcatt tgtcactgat 120 
gaccaaggag gaaatcacta agacatttga gaagcagtgg tatgaacgtt cttggacaag 18 0 
ccacagttct gagccttaac cctgtagttt gcacacaaga acgagctcca cctccccttc 240 
ttcaggagga atctgtgcgg atagattggc tggacttttc aatggttctg ggttgcaagt 300 
gggcactgtt atggctgggt atggagcgga cagccccagg aatcagagcc tcagcccggc 3 60 
tgcctggttg gaaggtacag gtgttcagca ccttcggaaa aagggcataa agtngtgggg 42 0 
gacaattctc agtccaagaa gaatgcattg accattgctg gctatttgct tncctagtan 480 
gaattggatn catttttgac cangatnntt ctnctatgct ttnttgcaat gaaatcaaat 540 
cccgcattat ctacaagtgg tatgaagtcc tgcnnccccc agagaggctg ttcaggcnat 600 
gtcttccaag ggcagggtgg gttacaccat tttacctccc ctctcccccc agattatgna 660 
cncagaagga atttntttcc tccc 684 

<210> 35 

<211> 614 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 17, 20, 152, 223, 267, 287, 304, 306, 316, 319, 321, 355, 
365, 382, 391, 407, 419, 428, 434, 464, 467, 477, 480, 495, 
499, 505, 515, 516, 522, 524, 527, 542, 547, 549, 567, 572, 
576, 578 

<223> n = A,T,C or G 



<400> 35 

actagtccaa cgcgttngcn aatattcccc 
ggtaagatcg agcaatggct tcaggacatg 
tcactgcatg aagactggct tgtctcagtg 
cacacctcgc tccctgttag tgccgtatga 
acggtttctc tgtggtcaat gttggtnggc 
aagncncgtg agcagncanc nccagttctg 
ttccngtttc tcctggccct gngtgggcta 
gaaggganga taantgggat ctaccaattg 
tgctttatgt ggganacana tctanctctc 
gntcgancnc gtcttcgatt ttcgganaca 
aaaaaaaaaa aaaa 

<210> 36 

<211> 686 

<212> DNA 

<213> Homo sapiens 



tggtagccta cttccttacc cccgaatatt 60 
ggttctcttc tcctgtgatc attcaagtgc 12 0 
tntcaacctc accagggctg tctcttggtc 180 
cagcccccat canatgacct tggccaagtc 24 0 
tgattggtgg aaagtanggt ggaccaaagg 300 
caccagcagc gcctccgtcc tactngggtg 3 60 
nggcctgatt cgggaanatg cctttgcang 420 
attctggcaa aacnatntct aagattnttn 48 0 
atttnntgct gnanatnaca ccctactcgt 540 
cnccantnaa tactggcgtt ctgttgttaa 600 
614 



<220> 

<221> misc_feature 

<222> 222, 224, 237, 264, 285, 548, 551, 628, 643, 645, 665, 674 
<223> n = A,T,C or G 



<400> 36 

gtggctggcc cggttctccg cttctcccca 
ctccctcgtc gactgttgct tgctggtcgc 
taacctcggt gccaccggat tgcccttctt 
gggcgggggc ctggagcagc ccgaggcact 
ctcagctcgc cagtccggtc gctngcttcc 
acctgctctg ggcacacgcg acccgtggtt 
ggtatttctt aatcagcgct tgcaaagatg 



tcccctactt tcctccctcc ctccctttcc 60 
agactccctg acccctccct cacccctccc 12 0 
ttcctgttgc ccagcccagc cctagtgtca 180 
gcagcagaag ananaaaaga cacgacnaac 24 0 
cgccgcatgg caatnagaca gacgccgctc 300 
gatttggcct tcagtggcat cacccttatg 360 
gttaacctat gctacgccag ggagatacag 420 
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gagactggat tggaacattt ttggggtcta aaggtctgtt tggggtgcaa cactgaataa 480 

ggatgccacc aaagcagcta cagcagctgc agatttcaca gcccaagtgt gggatgctgt 54 0 

ctcagganat naattgataa cctggctcat aacacattgt caagaatgtg gatttcccca 600 

ggatattatt atttgtttac cggggganag gataactgtt tcncntattt taattgaaca 660 

aactnaaaca aaanctaagg aaatcc 68 6 

<210> 37 

<211> 681 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 7, 10, 11, 19, 25, 32, 46, 53, 77, 93, 101, 103, 109, 115, 
123, 128, 139, 157, 175, 180, 192, 193, 194, 212, 218, 226, 
227, 233, 240, 241, 259, 260, 267, 289, 296, 297, 298, 312, 
313, 314, 320, 325, 330, 337, 345, 346, 352, 353, 356 
<223> n = A,T,C or G 

<221> misc_feature 

<222> 382, 385, 400, 427, 481, 484, 485, 491,. 505, 515, 533, 542, 
544, 554, 557, 560, 561, 564, 575, 583, 589, 595, 607, 619, 
628, 634, 641, 645, 658, 670 
<223> n = A,T,C or G 



<400> 37 

gagacanacn naacgtcang agaanaaaag 
caccttccca ccagcancca gcgcccccca 
cancctgnat caatctganc tctattcctg 
aaaggtcgca cnnncagaga agctgctgcc 
nataggaaac tggtgaccnn gctgcanaat 
cacactgagt tnnngatgan gcctnaccan 
tgcggaggaa ggaagacccc gnacnggatc 
gattatnccc cttgactgag tctctgaggg 
natnntgctc natcgggact gacangctgg 
tnanaccaac agcnacngan natnggggct 
cggcgcnggc cttcggtgnt gtcctccntc 
ggactcctcn ttgttccctc c 

<210> 38 

<211> 687 

<212> DNA 

<213> Homo sapiens 



angcatggaa cacaanccag gcncgatggc 60 
gcngccccca ngnccggang accangactc 120 
gcccatncct acctcggagg tggangccgn 180 
ancaccancc gccccnnccc tgncgggctn 24 0 
tcatacagga gcacgcgang ggcacnnnct 300 
ggacctnccc cagcnnattg annacnggac 360 
ctggccggcn tgccaccccc ccacccctag 42 0 
gctacccgaa cccgcctcca ttccctacca 480 
ggatnggagg ggctatcccc cancatcccc 54 0 
ccccngggtc ggngcaacnc tcctncaccc 600 
aacnaattcc naaanggcgg gccccccngt 660 
681 



<220> 

<221> misc_f eature 

<222> 3, 30, 132, 151, 203, 226, 228, 233, 252, 264, 279, 306, 
308, 320, 340, 347, 380, 407, 429, 437, 440, 445, 448, 491, 
559, 567, 586, 589, 593, 596, 603, 605, 606, 609, 626, 639, 
655, 674, 682 
<223> n = A,T,C or G 



<400> 38 

canaaaaaaa aaaacatggc cgaaaccagn 
ctcccggcct gtgtccggaa ggtttccctc 
gagggcggga cntgccgggg ccggagctca 
atcgcaaggg cggcgctaac ctnaggcctc 
gggggctgtg anaaccgcaa aaanaacgct 



aagctgcgcg atggcgccac ggcccctctt 60 
cgaggcgccc cggctcccgc aagcggagga 12 0 
naggccctgg ggccgctctg ctctcccgcc 18 0 
cccgcaaagg tccccnangc ggnggcggcg 24 0 
gggcgcgcng cgaacccgtc cacccccgcg 300 
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aaggananac ttccacagan gcagcgtttc cacagcccan agccacnttt ctagggtgat 360 
gcaccccagt aagttcctgn cggggaagct caccgctgtc aaaaaanctc ttcgctccac 42 0 
cggcgcacna aggggangan ggcangangc tgccgcccgc acaggtcatc tgatcacgtc 48 0 
gcccgcccta ntctgctttt gtgaatctcc actttgttca accccacccg ccgttctctc 54 0 
ctccttgcgc cttcctctna ccttaanaac cagcttcctc tacccnatng tanttnctct 600 
gcncnngtng aaattaattc ggtccnccgg aacctcttnc ctgtggcaac tgctnaaaga 660 
aactgctgtt ctgnttactg cngtccc 687 

<210> 39 

<211> 695 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 300, 401, 423, 429, 431, 437, 443, 448, 454, 466, 492, 515, 
523, 524, 536, 538, 541, 552, 561, 566, 581, 583, 619, 635, 
636, 641, 649, 661, 694 
<223> n = A,T,C or G 

<400> 39 

actagtctgg cctacaatag tgtgattcat gtaggacttc tttcatcaat tcaaaacccc 60 
tagaaaaacg tatacagatt atataagtag ggataagatt tctaacattt ctgggctctc 12 0 
tgacccctgc gctagactgt ggaaagggag tattattata gtatacaaca ctgctgttgc 180 
cttattagt't ataacatgat aggtgctgaa ttgtgattca caatttaaaa acactgtaat 240 
ccaaactttt ttttttaact gtagatcatg catgtgaatg ttaatgttaa tttgttcaan 300 
gttgttatgg gtagaaaaaa ccacatgcct taaaatttta aaaagcaggg cccaaactta 360 
ttagtttaaa attaggggta tgtttccagt ttgttattaa ntggttatag ctctgtttag 420 
aanaaatcna ngaacangat ttngaaantt aagntgacat tatttnccag tgacttgtta 480 
atttgaaatc anacacggca ccttccgttt tggtnctatt ggnntttgaa tccaancngg 54 0 
ntccaaatct tnttggaaac ngtccnttta acttttttac nanatcttat ttttttattt 600 
tggaatggcc ctatttaang ttaaaagggg ggggnnccac naccattcnt gaataaaact 660 
naatatatat ccttggtccc ccaaaattta aggng 695 



<210> 40 

<211> 674 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 403, 428, 432, 507, 530, 543, 580, 583, 591, 604, 608, 621, 
624, 626, 639, 672 
<223> n = A,T,C or G 



<400> 40 

actagtagtc agttgggagt ggttgctata ccttgacttc atttatatga atttccactt 60 

tattaaataa tagaaaagaa aatcccggtg cttgcagtag agttatagga cattctatgc 120 

ttacagaaaa tatagccatg attgaaatca aatagtaaag gctgttctgg ctttttatct 18 0 

tcttagctca tcttaaataa gtagtacact tgggatgcag tgcgtctgaa gtgctaatca 24 0 

gttgtaacaa tagcacaaat cgaacttagg atgtgtttct tctcttctgt gtttcgattt 300 

tgatcaattc tttaattttg ggaacctata atacagtttt cctattcttg gagataaaaa 360 

ttaaatggat cactgatatt taagtcattc tgcttctcat ctnaatattc catattctgt 42 0 

attagganaa antacctccc agcacagccc cctctcaaac cccacccaaa accaagcatt 48 0 

tggaatgagt ctcctttatt tccgaantgt ggatggtata acccatatcn ctccaatttc 54 0 

tgnttgggtt gggtattaat ttgaactgtg catgaaaagn ggnaatcttt nctttgggtc 600 

aaantttncc ggttaatttg nctngncaaa tccaatttnc tttaagggtg tctttataaa 660 

atttgctatt cngg 67 4 



WO 02/47534 



18 



PCT7US01/47576 



<210> 41 
<211> 657 
<212> DNA 
<213> Homo sapie: 



<220> 

<221> misc_feature 

<222> 243, 247, 251, 261, 267, 272, 298, 312, 315, 421, 432, 434, 
501, 524, 569, 594, 607, 650 
<223> n = A,T,C or G 



<400> 41 

gaaacatgca agtaccacac actgtttgaa ttttgcacaa aaagtgactg tagggatcag 60 
gtgatagccc cggaatgtac agtgtcttgg tgcaccaaga tgccttctaa aggctgacat 12 0 
accttgggac cctaatgggg cagagagtat agccctagcc cagtggtgac atgaccactc 180 
cctttgggag gctgaagtta aagggaatgg tatgtgtttt ctcatggaag cagcacatga 240 
atnggtnaca ngatgttaaa ntaaggntct antttgggtg tcttgtcatt tgaaaaantg 300 
acacactcct ancanctggt aaaggggtgc tggaagccat ggaagaactc taaaaacatt 360 
agcatgggct gatctgatta cttcctggca tcccgctcac ttttatggga agtcttatta 42 0 
naaggatggg ananttttcc atatccttgc tgttggaact ctggaacact ctctaaattt 480 
ccctctatta aaaatcactg nccttactac acttcctcct tganggaata gaaatggacc 540 
tttctctgac ttagttcttg gcatggganc cagcccaaat taaaatctga cttntccggt 600 
ttctccngaa ctcacctact tgaattggta aaacctcctt tggaattagn aaaaacc 657 

<210> 42 

<211> 389 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 179, 317, 320 
<223> n = A,T,C or G 



<400> 42 






actagtgctg 


aggaatgtaa 


acaagtttgc 


cgatagctca 


cactcctgca 


ctgtgcctgt 


caggaagaaa 


acaaaaacca 


gactgtgtcc 


ggccttcacc 


gccaccaggg 


tgtcccgcca 


atcctgaaga 


attcctgttt 


gggggttgtg 


tgttgcctgc 


ccgcgtngtn 


gggaagggac 


atattttaag 


ttaagaaaaa 


aaaaaaaaa 


<210> 43 






<211> 279 






<212> DNA 






<213> Homo 


sapiens 





tgggccttgc gagacttcac caggttgttt 60 

cacccaggaa tgtctttttt aattagaaga 12 0 

cacaatcaga aacctccgtt gtggcagang 18 0 

gacagggaga gactccagcc ttctgaggcc 24 0 

aaggaaaatc acccggattt aaaaagatgc 300 

tggtttcctg gtgaatttct taaaagaaaa 360 
389 



<400> 43 

actagtgaca agctcctggt cttgagatgt cttctcgtta aggagatggg ccttttggag 60 
gtaaaggata aaatgaatga gttctgtcat gattcactat tctagaactt gcatgacctt 120 
tactgtgtta gctctttgaa tgttcttgaa attttagact ttctttgtaa acaaataata 180 
tgtccttatc attgtataaa agctgttatg tgcaacagtg tggagatcct tgtctgattt 24 0 
aataaaatac ttaaacactg aaaaaaaaaa aaaaaaaaa 27 9 



<210> 44 
<211> 449 
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<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 245, 256, 264, 266, 273, 281, 323, 325, 337, 393 
<223> n = A,T,C or G 

<400> 44 

actagtagca tcttttctac aacgttaaaa ttgcagaagt agcttatcat taaaaaacaa 60 

caacaacaac aataacaata aatcctaagt gtaaatcagt tattctaccc cctaccaagg 12 0 

atatcagcct gttttttccc ttttttctcc tgggaataat tgtgggcttc ttcccaaatt 180 

tctacagcct ctttcctctt ctcatgcttg agcttccctg tttgcacgca tgcgttgtgc 240 

aagantgggc tgtttngctt ggantncggt ccnagtggaa ncatgctttc ccttgttact 300 

gttggaagaa actcaaacct tcnancccta ggtgttncca ttttgtcaag tcatcactgt 360 

atttttgtac tggcattaac aaaaaaagaa atnaaatatt gttccattaa actttaataa 42 0 

aactttaaaa gggaaaaaaa aaaaaaaaa 449 

<210> 45 

<211> 559 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 263 

<223> n = A,T,C or G 
<400> 45 

actagtgtgg gggaatcacg gacacttaaa gtcaatctgc gaaataattc ttttattaca 60 
cactcactga agtttttgag tcccagagag ccattctatg tcaaacattc caagtactct 120 
ttgagagccc agcattacat caacatgccc gtgcagttca aaccgaagtc cgcaggcaaa 18 0 
tttgaagctt tgcttgtcat tcaaacagat gaaggcaaga gtattgctat tcgactaatt 24 0 
ggtgaagctc ttggaaaaaa ttnactagaa tactttttgt gttaagttaa ttacataagt 300 
tgtattttgt taactttatc tttctacact acaattatgc ttttgtatat atattttgta 360 
tgatggatat ctataattgt agattttgtt tttacaagct aatactgaag actcgactga 420 
aatattatgt atctagccca tagtattgta cttaactttt acagggtgaa aaaaaaattc 480 
tgtgtttgca ttgattatga tattctgaat aaatatggga atatatttta atgtgggtaa 540 
aaaaaaaaaa aaaaaggaa 559 

<210> 46 

<211> 731 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 270, 467, 477, 502, 635, 660, 671, 688, 695, 697, 725 
<223> n = A,T,C or G 

<400> 46 

actagttcta gtaccatggc tgtcatagat gcaaccatta tattccattt 
tcaggttccc taacaattgt ttgaaactga atatatatgt ttatgtatgt 
actgtcatgt atatggtgta tatgggatgt gtgcagtttt cagttatata 
tatacatatg catatatatg tataatatac atatatacat gcatacactt 
catatatata cacatatatg cacacatatn atcactgagt tccaaagtga 
ggggcaattg tattctctcc ctctgtctgc tcactgggcc tttgcaagac 
cttgatttcc tttggataag agtcttatct tcggcactct tgactctagc 



agtttcttcc 60 
gtgtgtgttc 120 
tatattcata 180 
gtataatata 2 40 
gtctttattt 300 
atagcaattg 360 
cttaacttta 42 0 
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gatttctatt ccagaatacc tctcatatct atcttaaaac ctaaganggg taaagangtc 48 0 

ataagattgt agtatgaaag antttgctta gttaaattat atctcaggaa actcattcat 540 

ctacaaatta aattgtaaaa tgatggtttg ttgtatctga aaaaatgttt agaacaagaa 600 

atgtaactgg gtacctgtta tatcaaagaa cctcnattta ttaagtctcc tcatagccan 660 

atccttatat ngccctctct gacctgantt aatananact tgaataatga atagttaatt 72 0 

taggnttggg c 731 

<210> 47 
<211> 640 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 5, 28, 106, 153, 158, 173, 176, 182, 189, 205, 210, 214, 
225, 226, 229, 237, 260, 263, 269, 277, 281, 282, 322, 337, 
338, 354, 365, 428, 441, 443, 456, 467, 476, 484, 503, 508, 
554, 567, 575, 579, 588, 601, 606, 609, 611, 621, 636 
<223> n = A,T,C or G 



<400> 47 

tgcgngccgg tttggccctt ctttgtanga 
cgttaataac tcctcaggtc cctgcctgca 
gtacaccaaa tgtgacatcc tttcaccaat 
anacgactnc aacaattttt tgatnacccn 
ggagcagcat ggacctgtcn gcnactaang 
ttggtatgtc ttactgaaag anagaaacat 
caganattgc caatgccaag tccgagcggt 
tacatacntt gtccccgaaa nanaagatgc 
acanctacac ctggtgcttg ganaacanac 
cccagtgggt tttnccttgg cacctanctt 
ntggcnttnt nttgggacca ntcttctcac 

<210> 48 

<211> 257 

<212> DNA 

<213> Homo sapiens 



cactttcatc cgccctgaaa tcttcccgat 60 
cagggttttt tcttantttg ttgcctaaca 12 0 
atngattnct tcataccaca tcntcnatgg 180 
aaanactggg ggctnnaana agtacantct 240 
gaacaanagt nntgaacatt tacacaacct 300 
gcttctnncc ctagaccacg aggncaaccg 360 
tagatcaggt aatacattcc atggatgcat 42 0 
cctaanggct tcttcanact ggtccngaaa 480 
tctttggaag atcatctggc acaagttccc 540 
accanatcna ttcggaancc attctttgcc 600 
aactgnaccc 64 0 



<400> 48 

actagtatat gaaaatgtaa atatcacttg 
ccaccttgag cagccttgga aacctaacct 
tgattttctt tgttcctgaa aaagtgattt 
ttatatttgt atatgtatca tcataaaata 
aaaaaaaaaa aaaaaaa 



tgtactcaaa caaaagttgg tcttaagctt 60 
gcctctttta gcataatcac attttctaaa 120 
gtattagttt tacatttgtt ttttggaaga 180 
tttaaataaa aagtatcttt agagtgaaaa 240 
257 



<210> 49 
<211> 652 
<212> DNA 
<213> Homo 



sapiens 



<220> 

<221> misc_feature 

<222> 410, 428, 496, 571, 647 

<223> n = A,T,C or G 



<400> 49 

actagttcag atgagtggct gctgaagggg cccccttgtc attttcatta taacccaatt 60 
tccacttatt tgaactctta agtcataaat gtataatgac ttatgaatta gcacagttaa 12 0 
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gttgacacta gaaactgccc atttctgtat tacactatca aataggaaac attggaaaga 18 0 

tggggaaaaa aatcttattt taaaatggct tagaaagttt tcagattact ttgaaaattc 240 

taaacttctt tctgtttcca aaacttgaaa atatgtagat ggactcatgc attaagactg 300 

ttttcaaagc tttcctcaca tttttaaagt gtgattttcc ttttaatata catatttatt 3 60 

ttctttaaag cagctatatc ccaacccatg actttggaga tatacctatn aaaccaatat 42 0 

aacagcangg ttattgaagc agctttctca aatgttgctt cagatgtgca agttgcaaat 480 

tttattgtat ttgtanaata caatttttgt tttaaactgt atttcaatct atttctccaa 54 0 

gatgcttttc atatagagtg aaatatccca ngataactgc ttctgtgtcg tcgcatttga 600 

cgcataactg cacaaatgaa cagtgtatac ctcttggttg tgcattnacc cc 652 

<210> 50 
<211> 650 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 237, 270, 311, 443, 454, 488, 520, 535, 539, 556, 567, 594, 
603, 634 

<223> n = A,T,C or G 
<400> 50 

ttgcgctttg atttttttag ggcttgtgcc ctgtttcact tatagggtct agaatgcttg 60 
tgttgagtaa aaaggagatg cccaatattc aaagctgcta aatgttctct ttgccataaa 120 
gactccgtgt aactgtgtga acacttggga tttttctcct ctgtcccgag gtcgtcgtct 180 
gctttctttt ttgggttctt tctagaagat tgagaaatgc atatgacagg ctgagancac 240 
ctccccaaac acacaagctc tcagccacan gcagcttctc cacagcccca gcttcgcaca 300 
ggctcctgga nggctgcctg ggggaggcag acatgggagt gccaaggtgg ccagatggtt 360 
ccaggactac aatgtcttta tttttaactg tttgccactg ctgccctcac ccctgcccgg 420 
ctctggagta ccgtctgccc canacaagtg ggantgaaat gggggtgggg gggaacactg 480 
attcccantt agggggtgcc taactgaaca gtagggatan aaggtgtgaa cctgngaant 540 
gcttttataa attatnttcc ttgttanatt tattttttaa tttaatctct gttnaactgc 600 
ccngggaaaa ggggaaaaaa aaaaaaaaat tctntttaaa cacatgaaca 650 

<210> 51 

<211> 545 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 66, 159, 195, 205, 214, 243, 278, 298, 306, 337, 366, 375, 
382, 405, 446, 477, 492, 495, 503, 507, 508, 521, 537 
<223> n = A,T,C or G 

<400> 51 

tggcgtgcaa ccagggtagc tgaagtttgg gtctgggact ggagattggc cattaggcct 60 
cctganattc cagctccctt ccaccaagcc cagtcttgct acgtggcaca gggcaaacct 120 
gactcccttt gggcctcagt ttcccctccc cttcatgana tgaaaagaat actacttttt 180 
cttgttggtc taacnttgct ggacncaaag tgtngtcatt attgttgtat tgggtgatgt 240 
gtncaaaact gcagaagctc actgcctatg agaggaanta agagagatag tggatganag 300 
ggacanaagg agtcattatt tggtatagat ccacccntcc caacctttct ctcctcagtc 360 
cctgcncctc atgtntctgg tntggtgagt cctttgtgcc accanccatc atgctttgca 42 0 
ttgctgccat cctgggaagg gggtgnatcg tctcacaact tgttgtcatc gtttganatg 480 
catgctttct tnatnaaaca aanaaannaa tgtttgacag ngtttaaaat aaaaaanaaa 54 0 
caaaa • 545 



<210> 52 
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<211> 678 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 
<222> 98, 119, 121, 131, 136, 
176, 184, 189, 190, 191, 200, 
230, 237, 240, 241, 255, 264, 
291, 297, 301, 306, 308, 314, 
<223> n = A,T,C or G 



139, 140, 142, 143, 163, 168, 172, 

201, 205, 207, 221, 223, 229, 

266, 267, 276, 280, 288, 289, 

315, 326, 332, 335, 337 



<221> misc_feature 

<222> 339, 341, 343, 344, 345, 347, 350, 355, 356, 358, 362, 363, . 
372, 379, 395, 397, 398, 400, 403, 412, 414, 421, 423, 431, 
435, 438, 439, 450, 457, 463, 467, 471, 474, 480, 483, 484, 
487, 490, 491, 492, 493, 499, 500, 504, 508, 518, 536 
<223> n = A,T,C or G 

<221> misc_feature 

<222> 538, 549, 551, 552, 554, 556, 557, 562, 563, 567, 571, 572, 
576, 579, 590, 592, 595, 598, 606, 609, 613, 620, 622, 624, 
626, 631, 634, 638, 641, 647, 654, 660, 661, 674 
<223> n = A,T,C or G 

<400> 52 

actagtagaa gaactttgcc gcttttgtgc ctctcacagg cgcctaaagt cattgccatg 60 

ggaggaagac gatttggggg gggagggggg gggggcangg tccgtggggc tttccctant 120 

ntatctccat ntccantgnn cnntgtcgcc tcttccctcg tcncattnga anttantccc 180 

tggnccccnn nccctctccn ncctncncct cccccctccg ncncctccnn ctttttntan 240 

ncttccccat ctccntcccc cctnanngtc ccaacnccgn cagcaatnnc ncacttnctc 300 

nctccncncc tccnnccgtt cttctnttct cnacntntnc ncnnntnccn tgccnntnaa 360 

annctctccc cnctgcaan'c gattctctcc ctccncnnan ctntccactc cntncttctc 42 0 

ncncgctcct nttcntcnnc ccacctctcn ccttcgnccc cantacnctc nccncccttn 48 0 

cgnntcnttn nnntcctcnn accncccncc tcccttcncc cctcttctcc ccggtntntc 540 

tctctcccnc nncncnncct cnncccntcc nngcgnccnt ttccgccccn cnccnccntt 600 

ccttcntcnc cantccatcn cntntnccat nctncctncc nctcacnccc gctncccccn 660 

ntctctttca cacngtcc 678 



<210> 53 
<211> 502 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<222> 139, 146, 215, 217, 257, 263, 289, 386, 420, 452, 457, 461, 

466, 482, 486 

<223> n = A,T,C or G 



<400> 53 

tgaagatcct ggtgtcgcca tgggccgccg 

caagccgtac ccaaagtctc gcttctgccg 

tgacctgggg cggaaaaang caaaantgga 

agatcaatat gagcagctgt cctctgaagc 

gtacatggta aaaagtngtg gcnaagatgc 

cacgtcatcc gcatcaacaa gatgttgtcc 

atgcgaagtg cctttggaaa acccanggca 



ccccgcccgt tgttaccggt attgtaagaa 60 
aggtgtccct gatgccaaaa ttcgcatttt 120 
tgagtctccg ctttgtggcc acatggtgtc 180 
cctgnangct gcccgaattt gtgccaataa 240 
ttccatatcc gggtgcggnt ccaccccttc 300 
tgtgctgggg ctgacaggct cccaacaggc 360 
ctgtggccag ggttcacatt gggccaattn 42 0 
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atcatgttca tccgcaccaa ctgcagaaca angaacntgt naattnaagc cctgcccagg 480 
gncaanttca aatttcccgg cc 502 

<210> 54 

<211> 494 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 
<222> 431, 442, 445 
<223> n = A,T,C or G 

<400> 54 

actagtccaa gaaaaatatg 
tttaatgcca aaagtttgct 
gtttgcctta atgaatactg 
caagaaattt ctacatctta 
attatgagga ctttaatctt 
atgatttcta agtatatttt 
tgttaaattt ttctttcagt 
ctgttttgaa ngggatatga 
aaaaaaaaaa aaaa 

<210> 55 

<211> 606 

<212> DNA 

<213> Homo sapiens 



cttaatgtat attacaaagg 
ttgtccacaa tttccttaag 
ttgggaaaaa acacagtata 
gcgactccaa gaagaatgag 
tccttaaaca caataatgtt 
tcatgcagga cagtttttca 
ggcaacctct ataatcttta 
cnatnaatct atcagatggg 



ctttgtatat gttaacctgt 60 
acctcttcag aaagggattt 120 
atgagtgaaa agggcagaag 180 
tatccacatt tagatggcac 240 
ttcttttttc ttttattcac 300 
accttgatgt acagtgactg 360 
aaatatggtg agcatcttgt 420 
aaatcctgtt tccaagttag 480 
494 



<220> 

<221> misc_feature 

<222> 375, 395, 511, 542, 559, 569, 578, 581 
<223> n = A,T,C or G 



<400> 55 

actagtaaaa agcagcattg ccaaataatc cctaattttc cactaaaaat ataatgaaat 60 
gatgttaagc tttttgaaaa gtttaggtta aacctactgt tgttagatta atgtatttgt 12 0 
tgcttccctt tatctggaat gtggcattag cttttttatt ttaaccctct ttaattctta 180 
ttcaattcca tgacttaagg ttggagagct aaacactggg atttttggat aacagactga 240 
cagttttgca taattataat cggcattgta catagaaagg atatggctac cttttgttaa 300 
atctgcactt tctaaatatc aaaaaaggga aatgaagtat aaatcaattt ttgtataatc 360 
tgtttgaaac atgantttta tttgcttaat attanggctt tgcccttttc tgttagtctc 420 
ttgggatcct gtgtaaaact gttctcatta aacaccaaac agttaagtcc attctctggt 480 
actagctaca aattccgttt catattctac ntaacaattt aaattaactg aaatatttct 540 
anatggtcta cttctgtcnt ataaaaacna aacttgantt nccaaaaaaa aaaaaaaaaa 600 
aaaaaa 606 



<210> 56 

<211> 183 

<212> DNA 

<213> Homo sapiens 



<400> 56 

actagtatat ttaaacttac 
aattaacatg gttataatac 
gtgtgataaa ctga.ttttgg 
aaa 



aggcttattt gtaatgtaaa 
gtacaatcct tccctcatcc 
tttgcaataa aaccttgaaa 



ccaccatttt aatgtactgt 60 
catcacacaa ctttttttgt 120 
aataaaaaaa aaaaaaaaaa 18 0 
183 
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<210> 57 
<211> 622 
<212> DNA 
<213> Homo sapiei 



<220> 

<221> misc_feature 

<222> 358, 368, 412, 414, 425, 430, 453, 455, 469, 475, 495, 499, 
529, 540, 564, 575, 590 
<223> n = A,T,C or G 



<400> 57 

actagtcact actgtcttct ccttgtagct 
gcagtggaga gtgctgctgg gtgtacgctg 
aatcagtgag cactgttctg ctcagagctc 
ctgggtcaaa gctgcatgaa accaggccct 
agagaacctg acttctcttt ccctctccct 
agggatcttc tgagcttgtt tccctgctgg 
tctacaanaa gcagcccttc tttgtcctct 
gaganaccan aagcctctga tttttaattt 
atatatattt ctttnaatnt ttgagtcttt 
gaaacctgaa ttaaaaccat gaanaaaaat 
aaacttgaaa aaaaaaaaaa aa 



aatcaatcaa tattcttccc ttgcctgtgg 60 
cacctgccca ctgagttggg gaaagaggat 12 0 
ctgatctacc ccacccccta ggatccagga 18 0 
ggcagcaacc tgggaatggc tggaggtggg 24 0 
cctccaacat tactggaact ctatcctgtt 300 
gtgggacaga agacaaagga gaagggangg 360 
ggggttaatg agcttgacct ananttcatg 420 
ccntnaaatg tttgaagtnt atatntacat 48 0 
gatatgtctt aaaatccant ccctctgccn 54 0 
gtttncctta aagatgttan taattaattg 600 
622 



<210> 58 

<211> 433 

<212> DNA 

<213> Homo sapiens 



<400> 58 

gaacaaattc tgattggtta tgtaccgtca 
gtgtggaagc gttgaaaatt gaaagttact 
tcct'ttcagc tgccagtgtt gaataatgta 
accagcttta agctgaacca ttttatgaat 
catatttgtg actttaatcg tgctgcttgg 
tgacagtaaa cctgtccatt atgaatggcc 
ttatccacca aagacttcat ttgtgtatca 
aaaaaaaaaa aaa 



aaagacttga agaaatttca tgattttgca 60 

gcttttccac ttgctcatat agtaaaggga 120 

tcatccagag tgatgttatc tgtgacagtc 180 

accaaataaa tagacctctt gtactgaaaa 240 

atagaaatat ttttactggt tcttctgaat 300 

tactgttcta ttatttgttt tgacttgaat 360 

tcaataaagt tgtatgtttc aactgaaaaa 420 
433 



<210> 59 

<211> 649 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 22, 190, 217, 430, 433, 484, 544, 550, 577, 583, 594 
<223> n = A,T,C or G 



<400> 59 

actagttatt atctgacttt cnggttataa 
tgtcatttgg atttgcattt ctctgatgag 
ttggccatat gtgtatgttc cctggagaag 
attaggcgtn tgtcttttta ttactgagtt 
gacccttatc agatacatgg tttgcaaata 
ctttatcgat aatgtcctta gacatataat 
ggctgtgcaa ggtgggctca cgcttgtaat 
atcatatgan gangctagga gttcgaggtc 



tcattctaat gagtgtgaag tagcctctgg 60 
tgatgctatc aagcaccttt gctggtgctg 12 0 
tgtctgtgct gagccttggc ccacttttta 18 0 
gtaaganttc tttatatatt ctggattcta 24 0 
ttttctccca ttctgtgggt tgtgttttca 30 0 
aaatttgtat tttaaaagtg acttgatttg 360 
cccagcactt tgggagactg aggtgggtgg 420 
agcctggcca gcatagcgaa aacttgtctc 48 0 
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tacnaaaaat acaaaaatta gtcaggcatg gtggtgcacg tctgtaatac cagcttctca 54 0 
ggangctgan gcacaaggat cacttgaacc ccagaangaa gangttgcag tganctgaag 600 
atcatgccag ggcaacaaaa atgagaactt gtttaaaaaa aaaaaaaaa 64 9 

<210> 60 

<211> 423 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 209, 222, 277, 389, 398 

<223> n = A,T,C or G 



<400> 60 

actagttcag gccttccagt tcactgacaa acatggggaa gtgtgcccag ctggctggaa 60 

acctggcagt gataccatca agcctgatgt ccaaaagagc aaagaatatt tctccaagca 12 0 

gaagtgagcg ctgggctgtt ttagtgccag gctgcggtgg gcagccatga gaacaaaacc 180 

tcttctgtat tttttttttc cattagtana acacaagact cngattcagc cgaattgtgg 240 

tgtcttacaa ggcagggctt tcctacaggg ggtgganaaa acagcctttc ttcctttggt 300 

aggaatggcc tgagttggcg ttgtgggcag gctactggtt tgtatgatgt attagtagag 3 60 

caacccatta atcttttgta gtttgtatna aacttganct gagaccttaa acaaaaaaaa 420 
aaa 423 

<210> 61 

<211> 423 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 195, 285, 295, 329, 335, 340, 347, 367, 382, 383, 391, 396, 
418 

<223> n = A,T,C or G 



<400> 61 

cgggactgga atgtaaagtg 
tccctcccca gaccccagag 
caggtctgag tatggctggg 
actggatcag ggtanctaca 
atttggtgtt ggggtgcggg 
caacctccct tggggcaatt 
ttaaggnctt taaaaatgtt 



aagttcggag ctctgagcac 
ggagaggccc accccgccca 
agtcgggggc cacaggcctc 
agtggccggg ccttgccttt 
gtccctggcc cccttttcca 
gggcctggnt ctccncccgn 
annttttccc ntgccngggt 



gggctcttcc cgccgggtcc 60 
gccccgcccc agcccctgct 12 0 
tagctgtgct gctcaagaag 180 
gggattctac cctgttccta 240 
cactncctcc ctccngacag 300 
tgttgcnacc ctttgttggt 3 60 
taaaaaagga aaaaactnaa 42 0 
423 



<210> 62 

<211> 683 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 218, 291, 305, 411, 416, 441, 443, 453, 522, 523, 536, 542, 
547, 566, 588, 592, 595, 603, 621, 628, 630, 632, 644, 645, 
648, 655, 660, 672, 674, 676, 677, 683 
<223> n = A,T,C or G 



<400> 62 
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gctggagagg 


ggtacggact 


ttcttggagt 


tgtcccaggt 


tggaatgaga 


ctgaactcaa 


60 


gaagagaccc 


taagagactg 


gggaatggtt 


cctgccttca 


ggaaagtgaa 


agacgcttag 


120 


gctgtcaaca 


cttaaaggaa 


gtccccttga 


agcccagagt 


ggacagacta 


gacccattga 


180 


tggggccact 


ggccatggtc 


cgtggacaag 


acattccngt 


gggccatggc 


acaccggggg 


240 


ggatcaaaat 


gtgtacttgt 


ggggtctcgc 


cccttgccaa 


aaccaaacca 


ntcccactcc 


300 


tgtcnttgga 


ctttcttccc 


attccctcct 


ccccaaatgc 


acttcccctc 


ctccctctgc 




ccctcctgtg 


tttttggaat 


tctgtttccc 


tcaaaattgt 


taatttttta 


nttttngacc 


420 


atgaacttat 


gtttggggtc 


nangttcccc 


ttnccaatgc 


atactaatat 


attaatggtt 


480 


atttattttt 


gaaatatttt 


ttaatgaact 


tggaaaaaat 


tnntggaatt 


tccttncttc 


540 


cnttttnttt 


ggggggggtg 


gggggntggg 


ttaaaatttt 


tttggaancc 


cnatnggaaa 


600 


ttnttacttg 


gggcccccct 


naaaaaantn 


anttccaatt 


cttnnatngc 


ccctnttccn 


660 


ctaaaaaaaa 




aan 








683 


<210> 63 














<211> 731 














<212> DNA 














<213> Homo 


sapiens 












<220> 














<221> misc 


feature 












<222> 237," 


249, 263, 288, 312, 317, 323, 326, 337, 352, 


. 362, 370, 





377, 400, 411, 414, 434, 436, 446, 457, 473, 486, 497, 498, 

502, 512, 531, 546, 554, 563, 565, 566, 588, 597, 608, 611, 

613, 615, 627, 632, 640, 641, 644, 654, 660, 663, 665 
<223> n = A,T,C or G 

<221> misc_feature 

<222> 671, 678, 692, 697, 698, 699, 704, 705, 712, 714, 717, 718, 
719, 723, 725, 730, 731 
<223> n = A,T,C or G 

<400> 63 

actagtcata aagggtgtgc gcgtcttcga cgtggcggtc ttggcgccac tgctgcgaga 60 
cccggccctg gacctcaagg tcatccactt ggtgcgtgat ccccgcgcgg tggcgagttc 12 0 
acggatccgc tcgcgccacg gcctcatccg tgagagccta caggtggtgc gcagccgaga 180 
ccgcgagctc accgcatgcc cttcttggag gccgcgggcc acaagcttgg cgcccanaaa 240 
gaaggcgtng ggggcccgca aantaccacg ctctgggcgc tatggaangt cctcttgcaa 300 
taatattggt tnaaaanctg canaanagcc cctgcanccc cctgaactgg gntgcagggc 360 
cncttacctn gtttggntgc ggttacaaag aacctgtttn ggaaaaccct nccnaaaacc 420 
ttccgggaaa attntncaaa tttttnttgg ggaattnttg ggtaaacccc ccnaaaatgg 480 
gaaacntttt tgccctnnaa antaaaccat tnggttccgg gggccccccc ncaaaaccct 540 
tttttntttt tttntgcccc cantnncccc ccggggcccc tttttttngg ggaaaanccc 600 
cccccctncc nanantttta aaagggnggg anaatttttn nttncccccc gggncccccn 660 
ggngntaaaa nggtttcncc cccccgaggg gnggggnnnc ctcnnaaacc cntntcnnna 72 0 
ccncnttttn n 731 

<210> 64 

<211> 313 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc__feature 
<222> 240 

<223> n = A,T,C or G 
<400> 64 

actagttgtg caaaccacga ctgaagaaag acgaaaagtg ggaaataact tgcaacgtct 60 
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gttagagatg gttgctacac atgttgggtc tgtagagaaa catcttgagg agcagattgc 12 0 
taaagttgat agagaatatg aagaatgcat gtcagaagat ctctcggaaa atattaaaga 18 0 
gattagagat aagtatgaga agaaagctac tctaattaag tcttctgaag aatgaagatn 240 
aaatgttgat catgtatata tatccatagt gaataaaatt gtctcagtaa agttgtaaaa 300 
. aaa 313 



<210> 65 
<211> 420 
<212> DNA 
<213> Homo 



sapiens 



<220> 

<221> misc_feature 

<222> 400, 402, 403, 404, 405, 406, 409, 411, 412, 414, 415, 416 
<223> n = A,T,C or G 



<400> 65 

actagttccc tggcaggcaa gggcttccaa 
caggaagctg gcagtggcag cttctgtgtc 
tctgggaggt tggagggaag aatctaggcc 
gtagatactg ccttaacact ccctcctctc 
ctccgtgctc actaatttat ttccaggaaa 
atttgtttta acattttcat tgcaagtatt 
acacaaatta atgatattaa aaagcatcca 



ctgaggcagt gcatgtgtgg cagagagagg 60 
tagggagggg tgtggctccc tccttccctg 12 0 
ttagcttgcc ctcctgccac ccttcccctt 18 0 
tcagctgtgg ctgccaccca agccaggttt 24 0 
ggtgtgtgga agacatgagc cgtgtataat 300 
gaccatcatc cttggttgtg tatcgttgta 360 
aacaaagccn annnnnaana nnannngaaa 420 



<210> 66 

<211> 676 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 328, 454, 505, 555, 586, 612, 636, 641 
<223> n = A,T,C or G 

<400> 66 

actagtttcc tatgatcatt aaactcattc tcagggttaa gaaaggaatg taaatttctg 60 
cctcaatttg tacttcatca ataagttttt gaagagtgca gatttttagt caggtcttaa 120 
aaataaactc acaaatctgg atgcatttct aaattctgca aatgtttcct ggggtgactt 180 
aacaaggaat aatcccacaa tatacctagc tacctaatac atggagctgg ggctcaaccc 240 
actgttttta aggatttgcg cttacttgtg gctgaggaaa aataagtagt tccgagggaa 300 
gtagttttta aatgtgagct tatagatngg aaacagaata tcaacttaat tatggaaatt 360 
gttagaaacc tgttctcttg ttatctgaat cttgattgca attactattg tactggatag 42 0 
actccagccc attgcaaagt ctcagatatc ttanctgtgt agttgaattc cttggaaatt 480 
ctttttaaga aaaaattgga gtttnaaaga aataaacccc tttgttaaat gaagcttggc 54 0 
tttttggtga aaaanaatca tcccgcaggg cttattgttt aaaaanggaa ttttaagcct 600 
ccctggaaaa anttgttaat taaatgggga aaatgntggg naaaaattat ccgttagggt 660 
ttaaagggaa aactta 67 6 



<210> 67 

<211> 620 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 419, 493, 519, 568, 605, 610 
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<223> n = A,T,C or G 
<400> 67 

caccattaaa gctgcttacc aagaacttcc ccagcatttt gacttccttg tttgatagct 60 
gaattgtgag caggtgatag aagagccttt ctagttgaac atacagataa tttgctgaat 12 0 
acattccatt taatgaaggg gttacatctg ttacgaagct actaagaagg agcaagagca 180 
taggggaaaa aaatctgatc agaacgcatc aaactcacat gtgccccctc tactacaaac 24 0 
agattgtagt gctgtggtgg tttattccgt tgtgcagaac ttgcaagctg agtcactaaa 300 
cccaaagaga ggaaattata ggttagttaa acattgtaat cccaggaact aagtttaatt 360 
cacttttgaa gtgttttgtt ttttattttt ggtttgtctg atttactttg ggggaaaang 42 0 
ctaaaaaaaa agggatatca atctctaatt cagtgcccac taaaagttgt ccctaaaaag 48 0 
tctttactgg aanttatggg actttttaag ctccaggtnt tttggtcctc caaattaacc 54 0 
ttgcatgggc cccttaaaat tgttgaangg cattcctgcc tctaagtttg gggaaaattc 600 
ccccnttttn aaaatttgga 62 0 

<210> 68 

<211> 551 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature o 

<222> 286, 464, 480, 501, 502, 518, 528, 533, 536, 537, 538, 539, 
540, 541, 543, 544, 545, 547, 548, 549 
<223> n = A,T,C or G 

<400> 68 

actagtagct ggtacataat cactgaggag ctatttctta acatgctttt atagaccatg 60 

ctaatgctag accagtattt aagggctaat ctcacacctc cttagctgta agagtctggc 120 

ttagaacaga cctctctgtg caataacttg tggccactgg aaatccctgg gccggcattt 180 

gtattggggt tgcaatgact cccaagggcc aaaagagtta aaggcacgac tgggatttct 240 

tctgagactg tggtgaaact ccttccaagg ctgagggggt cagtangtgc tctgggaggg 300 

actcggcacc actttgatat tcaacaagcc acttgaagcc caattataaa attgttattt 360 

tacagctgat ggaactcaat ttgaaccttc aaaactttgt tagtttatcc tattatattg 42 0 

ttaaacctaa ttacatttgt ctagcattgg atttggttcc tgtngcatat gtttttttcn 48 0 

cctatgtgct cccctccccc rmatcttaat ttaaaccnca attttgcnat tcnccnnnnn 54 0 

nannnannna a 551 

<210> 69 

<211> 396 

<212> DNA 

<213> Homo sapiens 

<220> • 

<221> misc_feature 
<222> 235, 310, 323, 381 
<223> n = A,T,C or G 

<400> 69 

cagaaatgga aagcagagtt ttcatttctg tttataaacg tctccaaaca aaaatggaaa 60 

gcagagtttt cattaaatcc ttttaccttt tttttttctt ggtaatcccc tcaaataaca 12 0 

gtatgtggga tattgaatgt taaagggata tttttttcta ttatttttat aattgtacaa 180 

aattaagcaa atgttaaaag ttttatatgc tttattaatg ttttcaaaag gtatnataca 24 0 

tgtgatacat tttttaagct tcagttgctt gtcttctggt actttctgtt atgggctttt 300 

ggggagccan aaaccaatct acnatctctt tttgtttgcc aggacatgca ataaaattta 360 

aaaaataaat aaaaactatt nagaaattga aaaaaa 396 



<210> 70 
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<211> 536 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 388, 446, 455 
<223> n = A,T,C or G 



<400> 70 

actagtgcaa aagcaaatat aaacatcgaa aaggcgttcc tcacgttagc tgaagatatc 60 
cttcgaaaga cccctgtaaa agagcccaac agtgaaaatg tagatatcag cagtggagga 12 0 
ggcgtgacag gctggaagag caaatgctgc tgagcattct cctgttccat cagttgccat 18 0 
ccactacccc gttttctctt cttgctgcaa aataaaccac tctgtccatt tttaactcta 240 
aacagatatt tttgtttctc atcttaacta tccaagccac ctattttatt tgttctttca 300 
tctgtgactg cttgctgact ttatcataat tttcttcaaa caaaaaaatg tatagaaaaa 360 
tcatgtctgt gacttcattt ttaaatgnta cttgctcagc tcaactgcat ttcagttgtt 420 
ttatagtcca gttcttatca acattnaaac ctatngcaat catttcaaat ctattctgca 480 
aattgtataa gaataaaagt tagaatttaa caattaaaaa aaaaaaaaaa aaaaaa 53 6 

<210> 71 

<211> 865 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 22, 35, 39, 56, 131, 138, 146, 183, 194, 197, 238, 269, 277, 
282, 297, 316, 331, 336, 340, 341, 346, 349, 370, 376, 381, 
382, 392, 396, 397, 401, 433, 444, 445, 454, 455, 469, 472, 
477, 480, 482, 489, 497, 499, 511, 522, 526, 527 
<223> n = A,T,C or G 

<221> misc_feature 

<222> 545, 553, 556, 567, 574, 580, 610, 613, 634, 638, 639, 663, 
672, 689, 693, 694, 701, 704, 713, 723, 729, 732, 743, 744, 
749, 761, 765, 767, 769, 772, 774, 780, 783, 788, 792, 803, 
810, 824, 840, 848 
<223> n = A,T,C or G 



<400> 71 

gacaaagcgt taggagaaga anagaggcag 
cccaccagca accagcgccc cccaccagcc 
ggattaatct nacctctntc gcctgnccca 
tcncaccaag aganaanctg ctgccaacac 
gaaactggtg accaatctgc agaattctna 
cagagctgga tatgangcca gaccatggac 
gaagatggan gacccncgac nngatcaggc 
attcccgctg aangaatctc tgannggctt 
tncaacatng ggattanang ctgggaactg 
acaanctctc ccnaanaaac tggggcncct' 
cacgccaagn aantataaaa ggggggcccc 
ganggttatc cnccttgcgt accatggtnc 
ccncctatnt cnagccgaac tcnnatttnc 
ttngttgncc cngccctttc cgncggaacn 
aagggtgntt ggccccctcc ctccc 



ggaanactnc 


ccaggcacga 


tggccncctt 


60 


cccaggcccg 


gacgacgaag 


actccatcct 


120 


ttcctacctc 


ggaggtggag 


gccggaaagg 


180 


caaccgcccc 


agccctggcg 


ggcacganag 


240 


gaggaanaag 


cnaggggccc 


cgcgctnaga 


300 


nctacncccn 


ncaatncana 


cgggactgcg 


360 


cngctnncca 


nccccccacc 


cctatgaatt 


420 


ccannaaagc 


gcctccccnc 


cnaacgnaan 


480 


naaggggcaa 


ancctnnaat 


atccccagaa 


540 


catnggtggn 


accaactatt 


aactaaaccg 


600 


tccncggnng 


accccctttt 


gtcccttaat 


660 


ccnnttctgt 


ntgnatgttt 


ccnctcccct 


720 


ccgggggtgc 


natcnantng 


tncncctttn 


780 


cgtttccccg 


ttantaacgg 


cacccggggn 


840 








865 



<210> 72 
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<211> 560 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_f eature 

<222> 83, 173, 183, 186, 209, 211, 215, 255, 321, 322, 323, 335, 
344, 357, 361, 368, 394, 412, 415, 442, 455, 469, 472, 475, 
487, 513, 522, 528, 531, 534, 546 
<223> n = A,T,C or G 



<400> 72 

cctggacttg tcttggttcc agaacctgac 
aaaagacagt gtccagtgct ccngcctagg 
ccatgcccaa cttctctggc aactggaaaa 
tcnaantgct gggggtgaat gtgatgctna 
cagcagtgga gatcnaacag gagggagaca 
gcaccacaaa gattaacttc nnngttgggg 
ngcctgtnaa aacctggtga aatgggagaa 
cctgaaagga gaaggccccc anaactcctg 
actgatnctt gaaccctgaa cgggcgggat 
tttccntttc cccaaaaaaa 



gacccggcga cggcgacgtc tcttttgact 60 
agtctacggg gaccgcctcc cgcgccgcca 120 
tcatccgatc ggaaaacttc gangaattgc 18 0 
ngaanattgc tgtggctgca gcgtccaagc 240 
ctttctacat caaaacctcc accaccgtgc 300 
aggantttga ggancaaact gtggatngga 360 
tganaataaa atggtctgtg ancanaaact 420 
gaccngaaaa actgacccnc cnatngggga 48 0 
ganccttttt tnttgccncc naangggttc 540 
560 



<210> 73 

<211> 379 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 8, 17, 18, 21, 26, 
111, 112, 114, 119, 122, 
214, 215, 219, 220, 235, 
319, 322, 343, 353, 354 
<223> n = A,T,C or G 



29, 30, 32, 53, 56, 
124, 125, 134, 144, 
237, 246, 280, 288, 



67, 71, 81, 102, 104, 
146, 189, 190, 
302, 310, 313, 



<400> 73 

ctggggancc ggcggtnngc nccatntcnn 
aaccgcncaa naaacatgcc naagatatgg 
gnanngagga acanaacaaa ctcnangagc 
ttggccacnn gtggaattaa gaaatctggc 
ataagngacc ctttatttca tctgtattta 
tnccacgtan agntggaant anttgttgtc 
ttgttcaaaa aaaaaataa 



gncgcgaagg tggcaataaa aanccnctga 60 
acgaggaaga tngngctttc nngnacaanc 120 
tctcaagcta atgccgcggg gaaggggccc 180 
aaanngtann tgttccttgt gcctnangag 24 0 
aacctctctn ttccctgnca taacttcttt 300 
ttggactgtt gtncatttta gannaaactt 360 
379 



<210> 74 

<211> 437 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 145, 355 
<223> n = A,T,C or G 



<400> 74 

actagttcag actgccacgc caaccccaga aaatacccca catgccagaa aagtgaagtc 60 
ctaggtgttt ccatctatgt ttcaatctgt ccatctacca ggcctcgcga taaaaacaaa 120 
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acaaaaaaac gctgccaggt tttanaagca 
caccagggtt cttttgaaat agtaccacat 
aatcactgaa ttgtcaggct ttgattgata 
gaataagtta taatcagtat tcatctcttt 
gtcatttgta ctgtttgaaa aatatttctt 
aaaaaaaaaa aaaaaaa 



gttctggtct caaaaccatc aggatcctgc 18 0 
gtaaaaggga atttggcttt cacttcatct 24 0 
attgtagaaa taagtagcct tctgttgtgg 300 
gttttttgtc actcttttct ctctnattgt 360 
ctataaaatt aaactaacct gccttaaaaa 42 0 
437 



<210> 75 

<211> 579 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 440, 513, 539, 551 
<223> n = A,T,C or G 



<400> 75 

ctccgtcgcc gccaagatga tgtgcggggc gccctccgcc acgcagccgg ccaccgccga 60 
gacccagcac atcgccgacc aggtgaggtc ccagcttgaa gagaaagaaa acaagaagtt 12 0 
ccctgtgttt aaggccgtgt cattcaagag ccaggtggtc gcggggacaa actacttcat 18 0 
caaggtgcac gtcggcgacg aggacttcgt acacctgcga gtgttccaat ctctccctca 240 
tgaaaacaag cccttgacct tatctaacta ccagaccaac aaagccaagc atgatgagct 300 
gacctatttc tgatcctgac tttggacaag gcccttcagc cagaagactg acaaagtcat 360 
cctccgtcta ccagagcgtg cacttgtgat cctaaaataa gcttcatctc cgggctgtgc 42 0 
ccttggggtg gaaggggcan gatctgcact gcttttgcat ttctcttcct aaatttcatt 48 0 
gtgttgattc tttccttcca ataggtgatc ttnattactt tcagaatatt ttccaaatna 54 0 
gatatatttt naaaatcctt aaaaaaaaaa aaaaaaaaa 57 9 

<210> 76 

<211>' 666 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 411, 470, 476, 491, 506, 527, 560, 570, 632, 636, 643, 650, 
654, 658 

<223> n = A,T,C or G 



<400> 76 

gtttatccta tctctccaac 
tccctgtttc ttccacagtg 
ttgatgttgt tatgggcagg 
ttcctggcta ctccatgttg 
ctcactacag ggaccaggga 
cagcttctcc aacaataaaa 
taaaaaatat acagtttacc 
cagccagtga acaacctttt 
ttctcaataa ncctcacttt 
tcattttagg caaatatgan 
atatcaatta ccacccccat 
cttaaa 



cagattgtca gctccttgag 
cctaataata ctgtggaact 
atggcaacca gaccattgtc 
gctagcctct ggtaacctct 
tgatgcaaca tccttgtctt 
agcacgtggt aaaacacttg 
gaaaatcata ttatcttaca 
cccaccatac aaaaattcct 
cttaanatct tacaagatag 
ttttattgtn cgttacttgt 
ctcccatgaa anaaanggga 



ggcaagagcc acagtatatt 60 
aggttttaat aattttttaa 120 
tcagagcagg tgctggctct 18 0 
tacttattat cttcaggaca 240 
tttatgacag gatgtttgct 300 
cggatattct ggactgtttt 360 
atgaaaagga ntttatagat 42 0 
tttcccgaan gaaaanggct 480 
ccccganatc ttatcgaaac 54 0 
ttcaaaattt ggtattgtga 600 
aanggtgaan ttcntaancg 660 
666 



<210> 77 
<211> 396 
<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_feature 

<222> 31, 54, 125, 128, 136, 163, 168, 198 
<223> n = A,T,C or G 

<400> 77 

ctgcagcccg ggggatccac taatctacca nggttatttg gcagctaatt ctanatttgg 60 
atcattgccc aaagttgcac ttgctggtct cttgggattt ggccttggaa aggtatcata 12 0 
catanganta tgccanaata aattccattt ttttgaaaat canctccntg gggctggttt 18 0 
tggtccacag cataacangc actgcctcct tacctgtgag gaatgcaaaa taaagcatgg 24 0 
attaagtgag aagggagact ctcagccttc agcttcctaa attctgtgtc tgtgactttc 300 
gaagtttttt aaacctctga atttgtacac atttaaaatt tcaagtgtac tttaaaataa 360 
aatacttcta atgggaacaa aaaaaaaaaa aaaaaa 39 6 

<210> 78 

<211> 793 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 309, 492, 563, 657, 660, 703, 708, 710, 711, 732, 740, 748, 
758, 762, 765, 787 
<223> n = A, T, C or G 



<400> 78 

gcatcctagc cgccgactca cacaaggcag 
gaaaattcca gtgtcagcat tcttgctcct 
taccacagtc aaacctggag ccaaaaagga 
gaccctctcc agaggttggg gtgaccaact 
atataaatcc aagacaagca acaaaccctt 
acacagtcna gctttaaaga aagtgtttgc 
gcagtttgtc ctcctcaatc tggtttatga 
ccagtatgtc ccaggattat gtttgttgac 
ggaagatatt cnaaccgtct ctatgcttac 
atgaaaaagc tctcaagttg ctnaaaatga 
tctgtcggct tgaaaattga aaccagaaaa 
gacacctgat taggttttgg ttatgttcac 
ttggttcaat tntctttttn aaacaatntg 
aataatnttt ggc 

<210> 79 

<211> 456 

<212> DNA 

<213> Homo sapiens 



gtgggtgagg aaatccagag ttgccatgga 60 
tgtggccctc tcctacactc tggccagaga 120 
cacaaaggac tctcgaccca aactgcccca 180 
catctggact cagacatatg aagaagctct 24 0 
gatgattatt catcacttgg atgagtgccc 300 
tgaaaataaa gaaatccaga aattggcaga 360 
aacaactgac aaacaccttt ctcctgatgg 42 0 
ccatctctga cagttgaagc cgatatcctg 480 
aaactgcaga tacgctctgt tgcttgacac 540 
attgtaagaa aaaaaatctc cagccttctg 600 
atgtgaaaaa tggctattgt ggaacanatn 660 
cactattttt aanaaaanan nttttaaaat 72 0 
tttctacntt gnganctgat ttctaaaaaa 780 
793 



<220> 

<221> misc_feature 

<222> 89, 195, 255, 263, 266, 286, 353, 384, 423, 425, 436, 441 
<223> n = A,T,C or G 

<400> 79 

actagtatgg ggtgggaggc cccacccttc tcccctaggc gctgttcttg ctccaaaggg 60 
ctccgtggag agggactggc agagctgang ccacctgggg ctggggatcc cactcttctt 12 0 
gcagctgttg agcgcaccta accactggtc atgcccccac ccctgctctc cgcacccgct 180 
tcctcccgac cccangacca ggctacttct cccctcctct tgcctccctc ctgcccctgc 240 
tgcctctgat cgtangaatt gangantgtc ccgccttgtg gctganaatg gacagtggca 300 
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ggggctggaa atgggtgtgt gtgtgtgtgt gtgtgtgtgt gtgtgtgtgt gcnccccccc 360 
tgcaagaccg agattgaggg aaancatgtc tgctgggtgt gaccatgttt cctctccata 42 0 
aantncccct gtgacnctca naaaaaaaaa aaaaaa 45 6 

<210> 80 

<211> 284 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 283 

<223> n = A,T,C or G 



<400> 80 

ctttgtacct ctagaaaaga taggtattgt gtcatgaaac ttgagtttaa attttatata 60 
taaaactaaa agtaatgctc actttagcaa cacatactaa aattggaacc atactgagaa 12 0 
gaatagcatg acctccgtgc aaacaggaca agcaaatttg tgatgtgttg attaaaaaga 18 0 
aataaataaa tgtgtatatg tgtaacttgt atgtttatgt ggaatacaga ttgggaaata 240 
aaatgtattt cttactgtga aaaaaaaaaa aaaaaaaaaa aana 284 

<210> 81 

<211> 671 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 388, 505, 600, 603, 615, 642, 644, 660 
<223> n = A,T,C or G 



<400> 81 

gccaccaaca ttccaagcta ccctgggtac ctttgtgcag tagaagctag tgagcatgtg 60 
agcaagcggt gtgcacacgg agactcatcg ttataattta ctatctgcca agagtagaaa 120 
gaaaggctgg ggatatttgg gttggcttgg ttttgatttt ttgcttgttt gtttgttttg 180 
tactaaaaca gtattatctt ttgaatatcg tagggacata agtatataca tgttatccaa 240 
tcaagatggc tagaatggtg cctttctgag tgtctaaaac ttgacacccc tggtaaatct 300 
ttcaacacac ttccactgcc tgcgtaatga agttttgatt catttttaac cactggaatt 360 
tttcaatgcc gtcattttca gttagatnat tttgcacttt gagattaaaa tgccatgtct 42 0 
atttgattag tcttattttt ttatttttac aggcttatca gtctcactgt tggctgtcat 480 
tgtgacaaag tcaaataaac ccccnaggac aacacacagt atgggatcac atattgtttg 54 0 
acattaagct ttggccaaaa aatgttgcat gtgttttacc tcgacttgct aaatcaatan 600 
canaaaggct ggctnataat gttggtggtg aaataattaa tnantaacca aaaaaaaaan 660 
aaaaaaaaaa a 671 



<210> 82 

<211> 217 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 35 

<223> n = A,T,C or G 



<400> 82 

ctgcagatgt ttcttgaatg ctttgtcaaa ttaanaaagt taaagtgcaa taatgtttga 60 
agacaataag tggtggtgta tcttgtttct aataagataa acttttttgt ctttgcttta 12 0 
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tcttattagg gagttgtatg tcagtgtata aaacatactg tgtggtataa caggcttaat 180 
aaattcttta aaaggaaaaa aaaaaaaaaa aaaaaaa 217 

<210> 83 

<211> 460 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 104, 118, 172, 401, 422, 423, 444, 449 
<223> n = A,T,C or G 

<400> 83 

cgcgagtggg agcaccagga tctcgggctc ggaacgagac tgcacggatt gttttaagaa 60 
aatggcagac aaaccagaca tgggggaaat cgccagcttc gatnaggcca agctgaanaa 12 0 
aacggagacg caggagaaga acaccctgcc gaccaaagag accattgagc angagaagcg 180 
gagtgaaatt tcctaagatc ctggaggatt tcctaccccc gtcctcttcg agaccccagt 240 
cgtgatgtgg aggaagagcc acctgcaaga tggacacgag ccacaagctg cactgtgaac 300 
ctgggcactc cgcgccgatg ccaccggcct gtgggtctct gaagggaccc cccccaatcg 360 
gactgccaaa ttctccggtt tgccccggga tattatacaa nattatttgt atgaataatg 420 
annataaaac acacctcgtg gcancaaana aaaaaaaaaa 4 60 

<210> 84 

<211> 323 

<212> DNA 

<213> Homo sapiens 

<220>' 

<221> misc_feature 

<222> 70, 138, 178, 197, 228, 242, 244, 287, 311 
<223> n = A,T,C or G 

<400> 84 

tggtggatct tggctctgtg gagctgctgg gacgggatct aaaagactat tctggaagct 60 
gtggtccaan gcattttgct ggcttaacgg gtcccggaac aaaggacacc agctctctaa 12 0 
aattgaagtt tacccganat aacaatcttt tgggcagaga tgcctatttt aacaaacncc 180 
gtccctgcgc aacaacnaac aatctctggg aaataccggc catgaacntg ctgtctcaat 240 
cnancatctc tctagctgac cgatcatatc gtcccagatt actacanatc ataataattg 300 
atttcctgta naaaaaaaaa aaa 323 

<210> 85 

<211> 771 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 63, 426, 471, 497, 521, 554, 583, 586, 606, 609, 615, 652, 
686, 691, 694, 695, 706, 713, 730, 732, 743, 751 
<223> n = A,T,C or G 

<400> 85 

aaactgggta ctcaacactg agcagatctg ttctttgagc taaaaaccat gtgctgtacc 60 
aanagtttgc tcctggctgc tttgatgtca gtgctgctac tccacctctg cggcgaatca 120 
gaagcaagca actttgactg ctgtcttgga tacacagacc gtattcttca tcctaaattt 18 0 
attgtgggct tcacacggca gctggccaat gaaggctgtg acatcaatgc tatcatcttt 240 
cacacaaaga aaaagttgtc tgtgtgcgca aatccaaaac agacttgggt gaaatatatt 300 
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gtgcgtctcc tcagtaaaaa agtcaagaac atgtaaaaac tgtggctttt ctggaatgga 360 
attggacata gcccaagaac agaaagaact tgctggggtt ggaggtttca cttgcacatc 42 0 
atgganggtt tagtgcttat cttatttgtg cctcctggac ttgtccaatt natgaagtta 48 0 
atcatattgc atcatanttt gctttgttta acatcacatt naaattaaac tgtattttat 54 0 
gttatttata gctntaggtt ttctgtgttt aactttttat acnaantttc ctaaactatt 600 
ttggtntant gcaanttaaa aattatattt ggggggggaa taaatattgg antttctgca 660 
gccacaagct ttttttaaaa aaccantaca nccnngttaa atggtnggtc ccnaatggtt 72 0 
tttgcttttn antagaaaat ttnttagaac natttgaaaa aaaaaaaaaa a 771 



<210> 86 

<211> 628 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<222> 162, 249, 266, 348, 407, 427, 488, 518, 545, 566, 569, 597, 
598, 611, 617, 621, 624 
<223> n = A,T,C or G 

<400> 86 

actagtttgc tttacatttt tgaaaagtat tatttttgtc caagtgctta tcaactaaac 60 
cttgtgttag gtaagaatgg aatttattaa gtgaatcagt gtgacccttc ttgtcataag 12 0 
attatcttaa agctgaagcc aaaatatgct tcaaaagaaa angactttat tgttcattgt 180 
agttcataca ttcaaagcat ctgaactgta gtttctatag caagccaatt acatccataa 240 
gtggagaang aaatagatta atgtcnaagt atgattggtg gagggagcaa ggttgaagat 300 
aatctggggt tgaaattttc tagttttcat tctgtacatt tttagttnga catcagattt 360 
gaaatattaa tgtttacctt tcaatgtgtg gtatcagctg gactcantaa cacccctttc 42 0 
ttccctnggg gatggggaat ggattattgg aaaatggaaa gaaaaaagta cttaaagcct 480 
tcctttcnca gtttctggct cctaccctac tgatttancc agaataagaa aacattttat 54 0 
catcntctgc tttattccca ttaatnaant tttgatgaat aaatctgctt ttatgcnnac 600 
ccaaggaatt nagtggnttc ntcnttgt 62 8 



<210> 87 

<211> 518 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 384, 421, 486 
<223> n = A,T,C or G 



<400> 87 

ttttttattt tttttagaga gtagttcagc 
tataacaaca ttatactgtt tatggtttaa 
agtagtacag ttttaaaatt ttatgcttaa 
ttttacatgg caaatcaatt tttaagtcat 
aaacacattt aatttcaatt tctctcttat 
ctacagttta acaatgcagc aaaattccca 
ggttaaaatg ctttgaggat cctnaatacc 
naatttaacc ctcatgccat aagcagaagc 
taaaancgag ccccccgttg aaaaagcaaa 



ttttatttat aaatttattg cctgttttat 60 
tacatatggt tcaaaatgta taatacatca 120 
aacaagtttt gtgtaaaaaa tgcagataca 18 0 
cctaaaaatt gatttttttt tgaaatttaa 240 
ataaccttta ttactatagc atggtttcca 300 
tttcacggta aattgggttt taagcggcaa 360 
ctttgaactt caaatgaagg ttatggttgt 42 0 
acaagtttag ctgcattttg ctctaaactg 48 0 
agggaccc 518 



<210> 88 

<211> 1844 

<212> DNA 

<213> Homo sapiens 
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<400> ' 88 

gagacagtga atcctagtat caaaggattt 
tattttattt tatttttcga gactccgtct 
ggtatttgct aaagcatttt gagctgcttg 
ttccatcttc ttggtgctgg gaagccatat 
agcttatgtg ttgaatttgc tacatctata 
gaaatagaaa tatcatagaa catttaagaa 
taatcccttt gaagggatct atccaaagaa 
tctcagtaac agatcctgtg ttagtctttg 
tagatgtagc atacatatga tgtataatga 
ttgtaggaat acaaaacatg gcctttttta 
acatagggca atctgtgaat atgtattata 
taattttcaa gtcaaaaagg gatatggaaa 
ccttgctttt aaattaaacg ctacagccat 
taataatgtt aggttagcaa aggtttagat 
aatgcagctc ttcgagtcat ttctggtcat 
gcaccctacc tcacctgctt actgacattg 
tccattattc cttactgtat ataaaataca 
accatattca aaacctaaat ttgtttttgc 
gctttcacct agaagggtgt ggtcctgaag 
ggtgctcctc cttccctggt accctgacta 
agtgcagcag cctgtgcttc cacagatggg 
ccatcttagg gggagaagct agatcctgtg 
attgctcttc ctgctgctgt cctttgcttc 
catgcagcta acttgtgcct ctgcttatgc 
atttgaagtt caaaggtgta ttcaggatcc 
ccaatttacc gtgaaatggg aattttgctg 
tagtaataaa ggttatataa gagagaaatt 
aaaatcaatc tttaggatga cttaaaaatt 
ttacacaaaa cttgttttaa gcataaaatt 
ttttgaacca tatgtattaa accataaaca 
aaatttataa ataaaagctg aaaaaaaaaa 

<210> 89 

<211> 523 

<212> DNA 

<213> Homo sapiens 



ttggcctcag aaaaagttgt tgattatttt 60 
caaaaaaaaa aaaaaaaaaa agaatcacaa 120 
gaaaaaggga agtagttgca gtagagtttc 18 0 
atgtgtcttt tactcaagct aaggggtata 24 0 
tttcacatat tctcacaata agagaatttt 300 
agtttagtat aaataatatt ttgtgtgttt 360 
aatattttac actgagctcc ttcctacacg 42 0 
aaaatagctc attttttaaa tgtcagtgag 480 
cgtgtattat gttaacaatg tctgcagatt 54 0 
taagcaaaac gggccaatga ctagaataac 600 
agcagcattc cagaaaagta gttggtgaaa 660 
gggaattatg agtaacctct attttttaag 720 
ttaagccttg aggataataa agcttgagag 780 
gtatcacttc atgcatgcta ccatgatagt 84 0 
tcaagatatt cacccttttg cccatagaaa 900 
tcttagctga tcacaagatc attatcagcc 960 
gagttttata ttttcctttc ttcgtttttc 1020 
agatggaatg caaagtaatc aagtgttcgt 1080 
gaaagaggtc cctaaatatc ccccaccctg 1140 
ccagaagtca ggtgctagag cagctggaga 1200 
ggtgctgctg caacaaggct ttcaatgtgc 12 60 
cagcagcctg gtaagtcctg aggaggttcc 1320 
tcaacggggc tcgctctaca gtctagagca 1380 
atgagggtta aattaacaac cataaccttc 1440 
tcaaagcatt ttaaccttgc cgcttaaaac 1500 
cattgttaaa ctgtagtgga aaccatgcta 1560 
gaaattaaat gtgtttttaa atttcaaaaa 1620 
gatttgccat gtaaaatgta tctgcatttt 1680 
ttaaaactgt actacttgat gtattataca 17 40 
gtataatgtt gttataataa aacaggcaat 18 00 
aaaaaaaaaa aaaa 1844 



<220> 

<221> misc_feature 

<222> 288, 352, 369, 398, 475, 511, 513 
<223> n = A,T,C or G 



<400> 89 

tttttttttt tttttttagt caatccacat 
gggataaaga tgactgttag tcactcacag 
acaatatgat gtagaaaatg ctaagccaga 
tcaccttgtc tttccacatc cctacccttc 
ctccccactg cagatcccct gggattttgc 
gccctggcat gacttgaacc caaccacaga 
actttgatna gaaaacacat agggaattga 
ggtgctcaag aaaagtttgc agaatggata 
taattgaatg gtggctcaat aagaatgact 

<210> 90 
<211> 604 
<212> DNA 



ttattgatca cttattatgt accaggcact 60 
taaggaagaa aactagcaaa taagacgatt 12 0 
gatatagaaa ggtcctattg ggtccttctg 180 
acaggccttc cctccagctt cctgcccccg 240 
ctagagctaa acgagganat gggccccctg 300 
ctgggaaagg gagcctttcg anagtggatc 360 
agagaaantc cccaaatggc cacccgtgct 42 0 
aatgaaggat caagggaatt aatanatgaa 4 80 
ncnttgaatg acc 523 
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<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 563 

<223> n = A,T,C or G 



<400> 90 

ccagtgtggt ggaatgcaaa gattaccccg gaagctttcg agaagctggg attccctgca 60 
gcaaaggaaa tagocaatat gtgtcgtttc tatgaaatga agccagaccg agatgtcaat 120 
ctcacccacc aactaaatcc caaagtcaaa agcttcagcc agtttatctc agagaaccag 18 0 
gggagccttc aagggcatgt agaaaatcag ctgttcagat aggcctctgc accacacagc 24 0 
ctctttcctc tctgatcctt ttcctcttta cggcacaaca ttcatgtttg acagaacatg 300 
ctggaatgca attgtttgca acaccgaagg a-tttcctgcg gtcgcctctt cagtaggaag 360 
cactgcattg gtgataggac acggtaattt gattcacatt taacttgcta gttagtgata 42 0 
aggggtggta cacctgtttg gtaaaatgag aagcctcgga aacttgggag cttctctcct 48 0 
accactaatg gggagggcag attattactg ggatttctcc tggggtgaat taatttcaag 54 0 
ccctaattgc tgaaattccc ctnggcaggc tccagttttc tcaactgcat tgcaaaattc 600 
cccc 604 



<210> 91 

<211> 858 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 570, 591, 655, 664, 667, 683, 711, 759, 760, 765, 777, 787, 
792, 794, 801, 804, 809, 817, 820 
<223> n = A,T,C or G 

<400> 91 

tttttttttt ttttttttta tgattattat tttttttatt gatctttaca tcctcagtgt 60 
tggcagagtt tctgatgctt aataaacatt tgttctgatc agataagtgg aaaaaattgt 120 
catttcctta ttcaagccat gcttttctgt gatattctga tcctagttga acatacagaa 180 
ataaatgtct aaaacagcac ctcgattctc gtctataaca ggactaagtt cactgtgatc 24 0 
ttaaataagc ttggctaaaa tgggacatga gtggaggtag tcacacttca gcgaagaaag 300 
agaatctcct gtataatctc accaggagat tcaacgaatt ccaccacact ggactagtgg 360 
atcccccggg ctgcaggaat tcgatatcaa gcttatcgat accgtcgacc tcgagggggg 420 
gcccggtacc caattcgccc tatagtgagt cgtattacgc gcgctcactg gccgtcgttt 480 
tacaacgtcg tgactgggaa aaccctggcg ttacccaact taatcgcctt gcagcacatc 540 
cccctttcgc cagctggcgt aatagcgaan agcccgcacc gatcgccctt ncaacagttg 600 
cgcagcctga atggcgaatg ggacgcgccc tgtagcggcg cattaaagcg cggcngggtg 660 
tggnggntcc cccacgtgac cgntacactt ggcagcgcct tacgccggtc nttcgctttc 720 
ttcccttcct ttctcgcacc gttcgccggg tttccccgnn agctnttaat cgggggnctc 780 
cctttanggg tncnaattaa nggnttacng gaccttngan cccaaaaact ttgattaggg 8 40 
ggaaggtccc cgaagggg 858 

<210> 92 

<211> 585 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 317, 319, 320, 321, 325, 327, 328, 330, 331, 332, 460, 462, 
483, 485, 487, 523, 538, 566, 584 
<223> n = A,T,C or G 
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<400> 92 

gttgaatctc ctggtgagat tatacaggag attctctttc ttcgctgaag tgtgactacc 60 
tccactcatg tcccatttta gccaagctta tttaagatca cagtgaactt agtcctgtta 12 0 
tagacgagaa tcgaggtgct gttttagaca tttatttctg tatgttcaac taggatcaga 180 
atatcacaga aaagcatggc ttgaataagg aaatgacaat tttttccact tatctgatca 240 
gaacaaatgt ttattaagca tcagaaactc tgccaacact gaggatgtaa agatcaataa 300 
aaaaaataat aatcatnann naaanannan nngaagggcg gccgccaccg cggtggagct 360 
ccagcttttg ttccctttag tgagggttaa ttgcgcgctt ggcgttaatc atggtcatag 420 
ctgtttcctg tgtgaaattg ttatccggct cacaattccn cncaacatac gagccgggaa 480 
gcntnangtg taaaagcctg ggggtgccta attgagtgag ctnactcaca ttaattgngt 540 
tgcgctccac ttgcccgctt ttccantccg ggaaacctgt tcgnc 585 

<210> 93 

<211> 567 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 82, 158, 230, 232, 253, 266, 267, 268, 269, 270, 271, 272, 
273, 274, 275, 276, 277, 278, 279, 280, 281, 282, 283, 284, 
285, 286, 287, 295, 303, 307, 314, 349, 352, 354, 356, 366, 
369, 379, 382, 386, 393, 404, 427, 428, 446, 450, 452 
<223> n = A,T,C or G 



<221> misc_feature 

<222> 453, 454, 459, 462, 480, 481, 483, 488, 493, 501, 509, 511, 
512, 518, 520, 525, 526, 532, 541, 557 
<223> n = A,T,C or G 



<400> 93 

cggcagtgtt gctgtctgcg tgtccacctt ggaatctggc tgaactggct gggaggacca 60 
agactgcggc tggggtgggc anggaaggga accgggggct gctgtgaagg atcttggaac 120 
ttccctgtac ccaccttccc cttgcttcat gtttgtanag gaaccttgtg ccggccaagc 180 
ccagtttcct tgtgtgatac actaatgtat ttgctttttt tgggaaatan anaaaaatca 240 
attaaattgc tantgtttct ttgaannnnn nnnnnnnnnn nnnnnnnggg ggggncgccc 300 
ccncggngga aacnccccct tttgttccct ttaattgaaa ggttaattng cncncntggc 360 
gttaanccnt gggccaaanc tngttncccg tgntgaaatt gttnatcccc tcccaaattc 42 0 
ccccccnncc ttccaaaccc ggaaancctn annntgttna ancccggggg gttgcctaan 480 
ngnaattnaa ccnaaccccc ntttaaatng nntttgcncn ccacnngccc cnctttccca 540 
nttcggggaa aaccctntcc gtgccca 5 67 

<210> 94 

<211> 620 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 169, 171, 222, 472, 528, 559, 599 
<223> n = A,T,C or G 



<400> 94 

actagtcaaa aatgctaaaa taatttggga 
catgtttatc ttttattatg ttttgtgaag 
gccaatattt ccttatatct atccataaca 
gtgaaactta acactttata aggtaaaaat 



gaaaatattt tttaagtagt gttatagttt 60 
ttgtgtcttt tcactaatta cctatactat 12 0 
tttatactac atttgtaana naatatgcac 18 0 
gaggtttcca anatttaata atctgatcaa 24 0 
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gttcttgtta tttccaaata gaatggactt ggtctgttaa gggctaagga gaagaggaag 300 
ataaggttaa aagttgttaa tgaccaaaca ttctaaaaga aatgcaaaaa aaaagtttat 360 
tttcaagcct tcgaactatt taaggaaagc aaaatcattt cctaaatgca tatcatttgt 42 0 
gagaatttct cattaatatc ctgaatcatt catttcacta aggctcatgt tnactccgat 480 
atgtctctaa gaaagtacta tttcatggtc caaacctggt tgccatantt gggtaaaggc 54 0 
tttcccttaa gtgtgaaant atttaaaatg aaattttcct ctttttaaaa attctttana 600 
agggttaagg gtgttgggga 620 

<210> 95 
<211> 470 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 61, 67, 79, 89, 106, 213, 271, 281, 330, 354, 387, 432, 448 
<223> n = A,T,C or G 



<400> 95 

ctcgaccttc tctgcacagc ggatgaaccc tgagcagctg aagaccagaa aagccactat 60 
nactttntgc ttaattcang agcttacang attcttcaaa gagtgngtcc agcatccttt 12 0 
gaaacatgag ttcttaccag cagaagcaga cctttacccc accacctcag cttcaacagc 18 0 
agcaggtgaa acaacccatc cagcctccac ctnaggaaat atttgttccc acaaccaagg 24 0 
agccatgcca ctcaaaggtt ccacaacctg naaacacaaa nattccagag ccaggctgta 300 
ccaaggtccc tgagccaggg ctgtaccaan gtccctgagc caggttgtac caangtccct 360 
gagccaggat gtaccaaggt ccctgancca ggttgtccaa ggtccctgag ccaggctaca 420 
ccaagggcct gngccaggca gcatcaangt ccctgaccaa ggcttatcaa 47 0 

<210> 96 

<211> 660 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 299, 311, 360, 426, 538, 540, 542, 553, 563, 565, 592, 603, 
604, 618, 633, 647, 649, 651, 653 
<223> n = A,T,C or G 



<400> 96 

tttttttttt tttttttttt ggaattaaaa 
gcatttcttt tcattcgaat cttcagatga 
tgaagacttt ctgcttaatt caggggctta 
gctttatagt acgtattttt aggatacaaa 
tgtactgatt acaaggtcta cagacaatta 
cagcatctgg nggttggctt ctcaagggct 
cttctgctga gctgggcctg gagtgaccgt 
gcctgncaca ggaactttgg tgtatccttg 
aaacttgatg aagccttggt caagggacct 
ancctgggct canggacctt tgncncaacc 
gcnnagggac ccttgggncc aaccctgggc 



gcaatttaat gagggcagag caggaaacat &0 

accctgagca gccgaagacc agaaaagcca 120 

caggattctt cagagtgtgt gtgaacaaaa 18 0 

taagagagag actatggctt ggggtgagaa 24 0 

agacacagaa acagatggga agagggtgnc 30 0 

tgtctgtgca ccaaattact tctgcttggn 360 

tgaaggacat ggctctggta cctttgtgta 42 0 

ctcaggaact ttgatggcac ctggctcagg 480 

tgatgcttgc tggctcaggg accttggngn 54 0 

ttggcttcaa gggacccttg gnacatcctg 600 

ttnagggacc ctttggntnc nanccttggc 660 



<210> 97 

<211> 441 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_feature 

<222> 12, 308 

<223> n = A,T,C or G 



<400> 97 

gggaccatac anagtattcc tctcttcaca 
cccagcagca gaagcagccc tgcatcccac 
agccttgcca gcctccacct caggaaccat 
ccaaggtgcc tgagccctgc caccccaaag 
agccatgcca ccccaaggtg cctgagccct 
agcagaanac caagcagaag taatgtggtc 
agatgctgaa tcccctatcc cattctgtgt 
ctgtctcccc caaaaaaaaa a 

<210> 98 

<211> 600 

<212> DNA 

<213> Homo sapiens 



ccaggaccag ccactgttgc agcatgagtt 60 
cccctcagct tcagcagcag caggtgaaac 12 0 
gcatccccaa aaccaaggag ccctgccacc 18 0 
tgcctgagcc ctgccagccc aaggttccag 240 
gcccttcaat agtcactcca gcaccagccc 300 
cacagccatg cccttgagga gccggccacc 360 
atgagtccca tttgccttgc aattagcatt 42 0 
441 



<220> 

<221> misc_feature 

<222> 295, 349, 489, 496, 583 

<223> n = A,T,C or G 



<400> 98 

gtattcctct cttcacacca ggaccagcca ctgttgcagc atgagttccc agcagcagaa 60 
gcagccctgc atcccacccc ctcagcttca gcagcagcag gtgaaacagc cttgccagcc 120 
tccacctcag gaaccatgca tccccaaaac caaggagccc tgccacccca aggtgcctga 180 
gccctgccac cccaaagtgc ctgagccctg ccagcccaag gttccagagc catgccaccc 240 
caaggtgcct gagccctgcc cttcaatagt cactccagca ccagcccagc agaanaccaa 300 
gcagaagtaa tgtggtccac agccatgccc ttgaggagcc ggccaccana 'tgctgaatcc 360 
cctatcccat tctgtgtatg agtcccattt gccttgcaat tagcattctg tctcccccaa 42 0 
aaaagaatgt gctatgaagc tttctttcct acacactctg agtctctgaa tgaagctgaa 480 
ggtcttaant acaganctag ttttcagctg ctcagaattc tctgaagaaa agatttaaga 54 0 
tgaaaggcaa atgattcagc tccttattac cccattaaat tcnctttcaa ttccaaaaaa 600 



<210> 99 
<211> 667 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 345, 562, 635 
<223> n = A,T,C or G 



<400> 99 

actagtgact gagttcctgg caaagaaatt 
accatttaaa aaaatcagtg aaggatttga 
ggtcctgacg ttttgagatc caaagtggca 
tttctcttgt gagagttccc tcatctgaaa 
agtagaagat ttgttgaaga catagaaccc 
ttaaagtctt gtgagcacct gggaattagt 
attttgtaag gctataattg tatcttttaa 
tggagatttt taagagtttt aaccagctgc 
gtataaagat atagtaaatg catctcctag 



tgacctggac cagttgataa ctcatgtttt 60 
gctgctcaat tcaggacaaa gcattcgaac 12 0 
ggaggtctgt gttgtcatgg tgaactggag 18 0 
tcatgtatct gtctcacaaa tacaagcata 24 0 
ttataaagaa ttattaacct ttataaacat 300 
ataataacaa tgttnatatt tttgatttac 360 
gaaaacatac cttggatttc tatgttgaaa 42 0 
tgcagatata ttactcaaaa cagatatagc 480 
agtaatattc acttaacaca ttggaaacta 54 0 
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ttatttttta gatttgaata tnaatgttat tttttaaaca cttgttatga gttacttggg 600 
attacatttt gaaatcagtt cattccatga tgcanattac tgggattaga ttaagaaaga 660 
cggaaaa 667 



<210> 100 
<211> 583 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 404, 506, 514, 527, 528, 538, 548, 556, 568, 569 
<223> n = A,T,C or G 



<400> 100 

gttttgtttg taagatgatc acagtcatgt tacactgatc taaaggacat atatataacc 60 

ctttaaaaaa aaaatcactg cctcattctt atttcaagat gaatttctat acagactaga 120 

tgtttttctg aagatcaatt agacattttg aaaatgattt aaagtgtttt ccttaatgtt 180 

ctctgaaaac aagtttcttt tgtagtttta accaaaaaag tgcccttttt gtcactggat 240 

tctcctagca ttcatgattt ttttttcata caatgaaatt aaaattgcta aaatcatgga 300 

ctggctttct ggttggattt caggtaagat gtgtttaagg ccagagcttt tctcagtatt 360 

tgattttttt ccccaatatt tgatttttta aaaatataca catnggtgct gcatttatat 42 0 

ctgctggttt aaaattctgt catatttcac ttctagcctt ttagttatgg caaatcatat 480 

tttactttta cttaaagcat ttggtnattt ggantatctg gttctannct aaaaaaanta 540 
attctatnaa ttgaantttt ggtactcnnc catatttgga tec 583 

<210> 101 
<211> 592 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 218, 497, 502, 533, 544, 546, 548, 550, 555 
<223> n = A,T,C or G 



<400> 101 

gtggagacgt acaaagagca gccgctcaag 
gggaaacgea aggagcagga aaagaaaaaa 
ggagtgactg ggagtgggct agaaggggac 
gagctcgatt caeggaggea ttgaaatttt 
gattctgtaa tagtgaacat atggaaagta 
aaatgcattg gaataaaact gtctccccca 
tgaatatttt tttttttgee aaggctaatc 
attttgtcca ttgatgtatt tattttgtaa 
tttttgtaca taatgcnttt anatatacct 
gtgnencnan ttggnggttg aatttaatga 

<210> 102 
<211> 587 
<212> DNA 
<213> Homo sapiens 



acacctggga agaaaaagaa aggcaagccc 60 
cggcgaactc gctctgcctg gttagactct 120 
cacctgtctg acacctccac aacgtcgctg 180 
cagcaganac cttccaagga catattgeag 240 
ttagaaatat ttattgtctg taaatactgt 300 
ttgctctatg aaactgeaca ttggtcattg 360 
caattattat tatcacattt accataattt 420 
atgtatcttg gtgctgctga atttctatat 480 
atcaagtttg ttgataaatg aencaatgaa 540 
atgectaatt ttattatccc aa 592 



<220> 

<221> misc_feature 

<222> 91, 131, 256, 263, 332, 392, 400, 403, 461, 496, 497, 499, 
510, 511, 518, 519, 539, 554, 560, 576 
<223> n = A,T,C or G 
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<400> 102 

cgtcctaagc acttagacta catcagggaa gaacacagac cacatccctg tcctcatgcg 60 
gcttatgt'tt tctggaagaa agtggagacc nagtccttgg ctttagggct ccccggctgg 12 0 
gggctgtgca ntccggtcag ggcgggaagg gaaatgcacc gctgcatgtg aacttacagc 18 0 
ccaggcggat gccccttccc ttagcactac ctggcctcct gcatcccctc gcctcatgtt 240 
cctcccacct tcaaanaatg aanaacccca tgggcccagc cccttgccct ggggaaccaa 300 
ggcagccttc caaaactcag gggctgaagc anactattag ggcaggggct gactttgggt 360 
gacactgccc attccctctc agggcagctc angtcacccn ggnctcttga acccagcctg 42 0 
ttcctttgaa aaagggcaaa actgaaaagg gcttttccta naaaaagaaa aaccagggaa 480 
ctttgccagg gcttcnntnt taccaaaacn ncttctcnng gatttttaat tccccattng 540 
gcctccactt accnggggcn atgccccaaa attaanaatt tcccatc 587 

<210> 103 
<211> 496 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 2, 17, 66, 74, 82, 119, 164, 166, 172, 200, 203, 228, 232, 
271, 273, 415, 423, 445, 446, 473 
<223> n = A,T,C or G 



<400> 103 

anaggactgg ccctacntgc tctctctcgt 
ctgcanccct tggncactgc anatggaaac 
gcggtgggtc tccaccacaa ccactttgac 
actggcagga tggaccttan ccnacatatc 
cccttaacat gatataatcc acccatgcaa 
ttgcctacag aatttcattc agtctacact 
tgggctgacc gcaaaaggtg ccttacacac 
gangcttgcc tcctccttct gattnncccc 
ggaaaagaaa caaaac 



cctacctatc aatgcccaac atggcagaac 60 
ctctcagtgt cttgacatca ccctacccnt 120 
tctgtggtcc ctgnanggtg gnttctcctg 180 
cctctgttcc ctctgctnag anaaagaatt 240 
ntngctactg gcccagctac catttaccat 300 
ttggcattct ctctggcgat agagtgtggc 360 
tggcccccac cctcaaccgt tgacncatca 420 
catgttggat atcagggtgc tcnagggatt 480 
496 



<210> 104 
<211> 575 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 18, 19, 45, 68, 77, 132, 155, 174, 219, 226, 238, 259, 263, 
271, 273, 306, 323, 339, 363, 368, 370, 378, 381, 382, 436, 
440, 449, 450, 456, 481, 485, 496, 503, 510, 512, 515, 528, 
542, 552 

<223> n = A,T,C or G 



<400> 104 

gcacctgctc tcaatccnnc tctcaccatg 
ctatggangt ggtttcnggg gtggctcttg 
ctgttcaact cngtttgtgt ctgggggatc 
tgttttggtg gaagggctgg taattggctt 
gaagttgcta ttgaaagtng ccntggaagt 
ttgttnaatt tgggtgcttt gtnaatggcg 
ccnatgcngn aaacctcnac nnaacagcct 
cccccccaaa aaaggncaan cccctcaann 
ncccnaaaac aaaaancccc ccntttcccn 



atcctccgcc tgcanaaact cctctgccaa 60 
ccaactggga agaagccgtg gtgtctctac 12 0 
aactnggggc tatggaagcg gctnaactgt 180 
tgggaagtng cttatngaag ttggcctngg 24 0 
ngntttggtg gggggttttg ctggtggcct 300 
gccccctcnc ctgggcaatg aaaaaaatca 360 
gggcttccct cacctcgaaa aaagttgctc 42 0 
tggaangttg aaaaaatcct cgaatgggga 480 
gnaanggggg aaataccncc cccccactta 54 0 
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cnaaaaccct tntaaaaaac cccccgggaa aaaaa 575 

<210> 105 
<211> 619 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 260, 527, 560, 564, 566, 585, 599 
<223> n = A,T,C or G 

<400> 105 

cactagtagg atagaaacac tgtgtcccga gagtaaggag agaagctact attgattaga 60 

gcctaaccca ggttaactgc aagaagaggc gggatacttt cagctttcca tgtaactgta 120 

tgcataaagc caatgtagtc cagtttctaa gatcatgttc caagctaact gaatcccact 180 

tcaatacaca ctcatgaact cctgatggaa caataacagg cccaagcctg tggtatgatg 24 0 

tgcacacttg ctagactcan aaaaaatact actctcataa atgggtggga gtattttggt 300 

gacaacctac tttgcttggc tgagtgaagg aatgatattc atatattcat ttattccatg 360 

gacatttagt tagtgctttt tatataccag gcatgatgct gagtgacact cttgtgtata 42 0 

tttccaaatt tttgtacagt cgctgcacat atttgaaatc atatattaag acttccaaaa 480 

aatgaagtcc ctggtttttc atggcaactt gatcagtaaa ggattcncct ctgtttggta 540 

cttaaaacat ctactatatn gttnanatga aattcctttt ccccncctcc cgaaaaaana 600 

aagtggtggg gaaaaaaaa 619 

<210> 106 
<211> 506 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 8, 21, 31, 32, 58, 75, 89, 96, 99, 103, 122, 126, 147, 150, 
158, 195, 210, 212, 219, 226, 246, 248, 249, 255, 258, 261, 
263, 265, 275, 304, 317, 321, 331, 337, 340, 358, 371, 377, 
380, 396, 450, 491 
<223> n = A,T,C or G 

<400> 106 

cattggtnct ttcatttgct ntggaagtgt nnatctctaa cagtggacaa agttcccngt 60 
gccttaaact ctgtnacact tttgggaant gaaaanttng tantatgata ggttattctg 120 
angtanagat gttctggata ccattanatn tgcccccngt gtcagaggct catattgtgt 180 
tatgtaaatg gtatntcatt cgctactatn antcaattng aaatanggtc tttgggttat 240 
gaatantnng cagcncanct nanangctgt ctgtngtatt cattgtggtc atagcacctc 300 
acancattgt aacctcnatc nagtgagaca nactagnaan ttcctagtga tggctcanga 360 
ttccaaatgg nctcatntcn aatgtttaaa agttanttaa gtgtaagaaa tacagactgg 42 0 
atgttccacc aactagtacc tgtaatgacn ggcctgtccc aacacatctc ccttttccat 480 
gactgtggta ncccgcatcg gaaaaa 506 

<210> 107 
<211> 452 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 289, 317, 378 
<223> n = A,T,C or G 
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<400> 107 

gttgagtctg tactaaacag taagatatct caatgaacca taaattcaac tttgtaaaaa 60 
tcttttgaag catagataat attgtttggt aaatgtttct tttgtttggt aaatgtttct 12 0 
tttaaagacc ctcctattct ataaaactct gcatgtagag gcttgtttac ctttctctct 18 0 
ctaaggttta caataggagt ggtgatttga aaaatataaa attatgagat tggttttcct 240 
gtggcataaa ttgcatcact gtatcatttt cttttttaac cggtaagant ttcagtttgt 300 
tggaaagtaa ctgtganaac ccagtttccc gtccatctcc cttagggact acccatagaa 3 60 
catgaaaagg tccccacnga agcaagaaga taagtctttc atggctgctg gttgcttaaa 42 0 
ccactttaaa accaaaaaat tccccttgga aa 452 

<210> 108 
<211> 502 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 22, 31, 126, 168, 183, 205, 219, 231, 236, 259, 283, 295, 
296, 298, 301, 340, 354, 378, 383, 409, 433, 446, 455, 466, 
488 

<223> n = A,T,C or G 



<400> 108 

atcttcttcc cttaattagt tnttatttat 
caaaaagaga ttgtagattg gcttctggct 
agaccncaac tgaagcttaa aaaatctatc 
tanagcatat aaaactttta acatntgctt 
aaaatgtccc tttaacatnc aatatcccac 
naaaaaaagg gtagaaggga tttaatgaaa 
ctccagaaca aaaacttntc aantctttca 
aaactccatt agncccactt tctaanggtc 
accctggnta ctcctgccct ca 

<210> 109 
<211> 1308 
<212> DNA 
<213> Homo sapiens 



ntattaaatt ttattgcatg tcctggcaaa 60 
ccccaaaagc ccataacaga aagtaccaca 12 0 
acatgtataa tacctttnga agaacattaa 180 
aatgttgtnc aattataaaa ntaatngaaa 240 
atagtgttat ttnaggggat taccnngnaa 300 
actctgcttn ccatttctgt ttanaaacgt 360 
gctaaccgca tttgagctna ggccactcaa 420 
tctanagctt actaancctt ttgacccctt 480 
502 



<400> 109 

acccgaggtc tcgctaaaat catcatggat 
tttgatcttt tcaaagagct gaagaaaaca 
ggcatcttga ctgcaattgg catggtcctc 
ttggaggagg tgtttcactc tgaaaaagag 
aaagaggtga ttgagaacac agaagcagta 
ataagcaaac tcactaatga ttatgaactg 
acatacctct tccttcaaaa atacttagat 
gaacctgttg attttgtaaa tgcagccgat 
gaaagcaaaa caaatgaaaa aatcaaggac 
accaagctgg tgctggtgaa catggtttat 
aaagaaaata ctaaggaaga gaaattttgg 
atgatgacac agagccattc ctttagcttc 
ctagggattc catataaaaa caacgaccta 
gatggcctgg agaagataat agataaaata 
ccagggcata tggaagaaag aaaggtgaat 
agttacgatc tagaggcggt cctggctgcc 
aaagccgact actcgggaat gtcgtcaggc 
agttcctttg tggcagtaac tgaggaaggc 



tcacttggcg ccgtcagcac tcgacttggg 60 
aatgatggca acatcttctt ttcccctgtg 120 
ctggggaccc gaggagccac cgcttcccag 180 
acgaagagct caagaataaa ggctgaagaa 24 0 
catcaacaat tccaaaagtt tttgactgaa 300 
aacataacca acaggctgtt tggagaaaaa 3 60 
tatgttgaaa aatattatca tgcatctctg 42 0 
gaaagtcgaa agaagattaa ttcctgggtt 480 
ttgttcccag atggctctat tagtagctct 54 0 
tttaaagggc aatgggacag ggagtttaag 600 
atgaataaga gcacaagtaa atctgtacag 660 
actttcctgg aggacttgca ggccaaaatt 72 0 
agcatgtttg tgcttctgcc caacgacatc 780 
agtcctgaga aattggtaga gtggactagt 84 0 
ctgcacttgc cccggtttga ggtggaggac 900 
atggggatgg gcgatgcctt cagtgagcac 960 
tccgggttgt acgcccagaa gttcctgcac 102 0 
accgaggctg cagctgccac tggcataggc 108 0 
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tttactgtca catccgcccc aggtcatgaa aatgttcact gcaatcatcc cttcctgttc 1140 
ttcatcaggc acaatgaatc caacagcatc ctcttcttcg gcagattttc ttctccttaa 12 00 
gatgatcgtt gccatggcat tgctgctttt agcaaaaaac aactaccagt gttactcata 12 60 
tgattatgaa aatcgtccat tcttttaaat ggtggctcac ttgcattt 1308 

<210> 110 
<211> 391 
<212> PRT 

<213> Homo sapiens 



<400> 110 



Met 


Asp 


Ser 


Leu 


Gly 


Ala 


Val 


Ser 


Thr 


Arg 


Leu 


Gly 


Phe 


Asp Leu 


Phe 


1 








5 










10 










15 




Lys 


Glu 


Leu 


Lys 


Lys 


Thr 


Asn 


Asp 


Gly 


Asn 


He 


Phe 


Phe 


Ser 


Pro 


Val 








20 










25 










30 






Gly 


lie 


Leu 


Thr 


Ala 


He 


Gly 


Met 


Val 


Leu 


Leu 


Gly 


Thr 


Arg Gly Ala 






35 










40 










45 








Thr 


Ala 


Ser 


Gin 


Leu 


Glu 


Glu 


Val 


Phe 


His 


Ser 


Glu 


Lys 


Glu 


Thr 


Lys 




50 










55 










60 










Ser 


Ser 


Arg 


He 


Lys 


Ala 


Glu 


Glu 


Lys 


Glu 


Val 


He 


Glu 


Asn 


Thr 


Glu 


65 










70 










75 










80 


Ala 


Val 


His 


Gin 


Gin 


Phe 


Gin 


Lys 


Phe 


Leu 


Thr 


Glu 


He 


Ser 


Lys 


Leu 










85 










90 










95 




Thr 


Asn 


Asp 


Tyr 


Glu 


Leu 


Asn 


He 


Thr 


Asn 


Arg 


Leu 


Phe 


Gly 


Glu 


Lys 








100 










105 










110 






Thr 


Tyr 


Leu 


Phe 


Leu 


Gin 


Lys 


Tyr 


Leu 


Asp 


Tyr 


Val 


Glu 


Lys 


Tyr 


Tyr 






115 










120 










125 








His 


Ala 


Ser 


Leu 


Glu 


Pro 


Val 


Asp 


Phe 


Val 


Asn 


Ala 


Ala 


Asp 


Glu 


Ser 




130 










135 










140 










Arg 


Lys 


Lys 


He 


Asn 


Ser 


Trp 


Val 


Glu 


Ser 


Lys 


Thr 


Asn 


Glu 


Lys 


He 


145 










150 










155 










160 


Lys 


Asp 


Leu 


Phe 


Pro 


Asp 


Gly 


Ser 


He 


Ser 


Ser 


Ser 


Thr 


Lys 


Leu 


Val 










165 










170 










175 




Leu 


Val 


Asn 


Met 


Val 


Tyr 


Phe 


Lys 


Gly 


Gin 


Trp 


Asp 


Arg 


Glu 


Phe 


Lys 








180 










185 










190 






Lys 


Glu 


Asn 


Thr 


Lys 


Glu 


Glu 


Lys 


Phe 


Trp 


Met 


Asn 


Lys 


Ser 


Thr 


Ser 






195 










200 










205 








Lys 


Ser 


Val 


Gin 


Met 


Met 


Thr 


Gin 


Ser 


His 


Ser 


Phe 


Ser 


Phe 


Thr 


Phe 




210 










215 










220 










Leu 


Glu 


Asp 


Leu 


Gin 


Ala 


Lys 


He 


Leu 


Gly 


He 


Pro 


Tyr 


Lys 


Asn 


Asn 


225 










230 










235 










240 


Asp 


Leu 


Ser 


Met 


Phe 


Val 


Leu 


Leu 


Pro 


Asn 


Asp 


He 


Asp 


Gly 


Leu 


Glu 










245 










250 










255 




Lys 


lie 


He 


Asp 


Lys 


He 


Ser 


Pro 


Glu 


Lys 




Val 


Glu 


Trp 


Thr 


Ser 








260 










265 










270 






Pro 


Gly 


His 


Met 


Glu 


Glu 


Arg 


Lys 


Val 


Asn 


Leu 


His 


Leu 


Pro Arg 


Phe 






275 










280 










285 








Glu 


Val 


Glu 


Asp 


Ser 


Tyr 


Asp 


Leu 


Glu 


Ala 


Val 


Leu 


Ala 


Ala 


Met 


Gly 




290 










295 










300 










Met 


Gly 


Asp 


Ala 


Phe 


Ser 


Glu 


His 


Lys 


Ala 


Asp 


Tyr 


Ser 


Gly Met 


Ser 


305 










310 










315 










320 


Ser 


Gly 


Ser 


Gly 


Leu 


Tyr 


Ala 


Gin 


Lys 


Phe 


Leu 


His 


Ser 


Ser 


Phe 


Val 










325 










330 










335 




Ala 


Val 


Thr 


Glu 


Glu 


Gly 


Thr 


Glu 


Ala 


Ala 


Ala 


Ala 


Thr 


Gly 


He 


Gly 








340 










345 










350 






Phe 


Thr 


Val 


Thr 


Ser 


Ala 


Pro 


Gly 


His 


Glu 


Asn 


Val 


His 


Cys 


Asn 


His 






355 










3 60 










365 








Pro 


Phe 


Leu 


Phe 


Phe 


He 


Arg 


His 


Asn 


Glu 


Ser 


Asn 


Ser 


He 


Leu 


Phe 
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370 375 
Phe Gly Arg Phe Ser Ser Pro 
385 390 



<210> 111 
<211> 1419 
<212> DNA 
<213> Homo sapiens 

<400> 111 
ggagaactat 
ccagccacca 
ggcgccgtca 
ggcaacatct 
acccgaggag 
agctcaagaa 
attgagaaca 
ctcactaatg 
ttccttcaaa 
gattttgtaa 
acaaatgaaa 
gtgctggtga 
actaaggaag 
cagagccatt 
ccatataaaa 
gagaagataa 
atggaagaaa 
ctagaggcgg 
tactcgggaa 
gtggcagtaa 
acatccgccc 
cacaatgaat 
tgccatggca 
aaatcgtcca 



<210> 112 


















<211> 400 


















<212> PRT 


















<213> Homo sapiens 














<400> 112 


















Met Asp Ser 


Leu 


Gly 


Ala 


Val 


Ser 


Thr 


Arg 


Leu Gly Phe Asp Leu Phe 


1 




5 










10 


15 


Lys Glu Leu 


Lys 


Lys 


Thr 


Asn 


Asp 


Gly 


Asn 


He Phe Phe Ser Pro Val 




20 










25 




30 


Gly He Leu 


Thr 


Ala 


He 


Gly 


Met 


Val 


Leu 


Leu Gly Thr Arg Gly Ala 


35 










40 






45 


Thr Ala Ser 


Gin 


Leu 


Glu 


Glu 


Val 


Phe 


His 


Ser Glu Lys Glu Thr Lys 


50 








55 








60 


Ser Ser Arg 


He 


Lys 


Ala 


Glu 


Glu 


Lys 


Glu 


Val Val Arg He Lys Ala 


65 






70 










75 80 


Glu Gly Lys 


Glu 


He 


Glu 


Asn 


Thr 


Glu 


Ala 


Val His Gin Gin Phe Gin 






85 










90 


95 


Lys Phe Leu 


Thr 


Glu 


He 


Ser 


Lys 


Leu 


Thr 


Asn Asp Tyr Glu Leu Asn 




100 










105 




110 


He Thr Asn 


Arg 


Leu 


Phe 


Gly 


Glu 


Lys 


Thr 


Tyr Leu Phe Leu Gin Lys 


115 










120 






125 



aaattaagga tcccagctac ttaattgact tatgcttcct agttcgttgc 60 
ccgtctctcc aaaaacccga ggtctcgcta aaatcatcat ggattcactt 12 0 
gcactcgact tgggtttgat cttttcaaag agctgaagaa aacaaatgat 180 
tcttttcccc tgtgggcatc ttgactgcaa ttggcatggt cctcctgggg 240 
ccaccgcttc ccagttggag gaggtgtttc actctgaaaa agagacgaag 300 
taaaggctga agaaaaagag gtggtaagaa taaaggctga aggaaaagag 360 
cagaagcagt acatcaacaa ttccaaaagt ttttgactga aataagcaaa 420 
attatgaact gaacataacc aacaggctgt ttggagaaaa aacatacctc 480 
aatacttaga ttatgttgaa aaatattatc atgcatctct ggaacctgtt 540 
atgcagccga tgaaagtcga aagaagatta attcctgggt tgaaagcaaa 600 
aaatcaagga cttgttccca gatggctcta ttagtagctc taccaagctg 660 
acatggttta ttttaaaggg caatgggaca gggagtttaa gaaagaaaat 720 
agaaattttg gatgaataag agcacaagta aatctgtaca gatgatgaca 780 
cctttagctt cactttcctg gaggacttgc aggccaaaat tctagggatt 84 0 
acaacgacct aagcatgttt gtgcttctgc ccaacgacat cgatggcctg 900 
tagataaaat aagtcctgag aaattggtag agtggactag tccagggcat 960 
gaaaggtgaa tctgcacttg ccccggtttg aggtggagga cagttacgat 1020 
tcctggctgc catggggatg ggcgatgcct tcagtgagca caaagccgac 1080 
tgtcgtcagg ctccgggttg tacgcccaga agttcctgca cagttccttt 1140 
ctgaggaagg caccgaggct gcagctgcca ctggcatagg ctttactgtc 1200 
caggtcatga aaatgttcac tgcaatcatc ccttcctgtt cttcatcagg 1260 
ccaacagcat cctcttcttc ggcagatttt cttctcctta agatgatcgt 1320 
ttgctgcttt tagcaaaaaa caactaccag tgttactcat atgattatga 138 0 
ttcttttaaa tggtggctca cttgcattt 1419 
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Tyr 


Leu 
130 


Asp 


Tyr 


Val 


Glu 


Lys 
135 


Tyr 


Tyr 


His 


Ala 


Ser 
140 


Leu 


Glu 


Pro 


Val 


Asp 


Phe 


Val 


Asn 


Ala 


Ala 


Asp 


Glu 


Ser 


Arg 


Lys 


Lys 


He 


Asn 


Ser 


Trp 


145 










150 










155 










160 


Val 


Glu 


Ser 




Thr 
165 


Asn 


Glu 


Lys 


He 


Lys 
170 


Asp 


Leu 


Phe 


Pro 


Asp 
175 


Gly 


Ser 


He 


Ser 


Ser 


Ser 


Thr 


Lys 


Leu 


Val 


Leu 


Val 


Asn 


Met 


Val 


Tyr 


Phe 








180 










185 










190 




Lys 


Gly 


Gin 


Trp Asp Arg 


Glu 


Phe 


Lys 


Lys 


Glu 


Asn 


Thr 


Lys 


Glu 


Glu 






195 










200 










205 








Lys 


Phe 
210 


Trp 


Met 


Asn 


Lys 


Ser 
215 


Thr 


Ser 


Lys 


Ser 


Val 
220 


Gin 


Met 


Met 


Thr 


Gin 


Ser 


His 


Ser 


Phe 


Ser 


Phe 


Thr 


Phe 


Leu 


Glu 


Asp 


Leu 


Gin 


Ala 


Lys 


225 










230 










235 








240 


lie 


Leu 


Gly 


He 


Pro 
245 


Tyr 


Lys 


Asn 


Asn 


Asp 
250 




Ser 


Met 


Phe 


Val 
255 


Leu 


Leu 


Pro 


Asn 


Asp 
260 


He 


Asp 


Gly 


Leu 


Glu 
265 


Lys 


He 


He 


Asp 


Lys 
270 


He 


Ser 


Pro 


Glu 


Lys 
275 


Leu 


Val 


Glu 


Trp 


Thr 
280 


Ser 


Pro 


Gly 


His 


Met 
285 


Glu 


Glu 


Arg 


Lys 


Val 
290 


Asn 


Leu 


His 


Leu 


Pro 
295 


Arg 


Phe 


Glu 


Val 


Glu 
300 


Asp 


Ser 


Tyr 


Asp 


Leu 


Glu 


Ala 


Val 


Leu 


Ala 


Ala 


Met 


Gly 


Met 


Gly 




Ala 


Phe 


Ser 


Glu 


305 










310 










315 










320 


His 


Lys 


Ala 


Asp 


Tyr 
325 


Ser 


Gly 


Met 


Ser 


Ser 
330 


Gly 


Ser 


Gly 


Leu 


Tyr 
335 


Ala 


Gin 


Lys 


Phe 


Leu 
340 


His 


Ser 


Ser 


Phe 


Val 
345 


Ala 


Val 


Thr 


Glu 


Glu 
350 


Gly 


Thr 


Glu 


Ala 


Ala 


Ala 


Ala 


Thr 


Gly 


He 


Gly 


Phe 


Thr 


Val 


Thr 


Ser 


Ala 


Pro 






355 










360 








365 








Gly His 


Glu 


Asn 


Val 


His 


Cys 


Asn 


His 


Pro 


Phe 


Leu 


Phe 


Phe 


lie 


Arg 




370 










375 










380 








His 


Asn 


Glu 


Ser 


As n 


Ser 


He 


Leu 


Phe 


Phe 


Gly 


Arg 


Phe 


Ser 


Ser 


Pro 


385 










390 










395 










400 



<210> 113 
<211> 957 
<212> DNA 

<213> Homo sapiens 
<400> 113 

ctcgaccttc tctgcacagc 
gactttctgc ttaattcagg 
gaaacatgag ttcttaccag 
agcaggtgaa acaacccagc 
agccatgcca ctcaaaggtt 
ccaaggtccc tgagccaggc 
agccaggatg taccaaggtc 
ccaaggtccc tgagccaggc 
agccaggtgc catcaaagtt 
caaaggtacc agagccatgt 
agcagaagta atttggtgca 
ccctcttccc atctgtttct 
caccccaagc catagtctct 
tgttcacaca cactctgaag 
cttttctggt cttcggctgc 
tttcctgctc tgccctcatt 



ggatgaaccc tgagcagctg 
agcttacagg attcttcaaa 
cagaagcaga cctttacccc 
cagcctccac ctcaggaaat 
ccacaacctg gaaacacaaa 
tgtaccaagg tccctgagcc 
cctgagccag gttgtaccaa 
agcatcaagg tccctgacca 
cctgagcaag gatacaccaa 
ccttcaacgg tcactccagg 
cagacaagcc cttgagaagc 
gtgtcttaat tgtctgtaga 
ctcttatttg tatcctaaaa 
aatcctgtaa gcccctgaat 
tcagggttca tctgaagatt 
aaattgcttt taattccaaa 



aagaccagaa aagccactat 60 
gagtgtgtcc agcatccttt 120 
accacctcag cttcaacagc 180 
atttgttccc acaaccaagg 240 
gattccagag ccaggctgta 300 
aggttgtacc aaggtccctg 360 
ggtccctgag ccaggctaca 42 0 
aggcttcatc aagtttcctg 480 
agttcctgtg ccaggctaca 540 
cccagctcag cagaagacca 600 
caaccaccag atgctggaca 660 
ccttgtaatc agtacattct 72 0 
atacggtact ataaagcttt 780 
taagcagaaa gtcttcatgg 8 40 
cgaatgaaaa gaaatgcatg 900 
aaaaaaaaaa aaaaaaa 957 
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<210> 114 
<211> 161 
<212> PRT 
<213> Homo sapiens 



<400> 114 



Met Ser 


Ser 


Tyr 


Gin 


Gin 


Lys 


Gin 


Thr Phe 


Thr Pro 


Pro 


Pro 


Gin 


Leu 


1 






5 








10 








15 




Gin Gin 


Gin 


Gin 


Val 


Lys 


Gin 


Pro 


Ser Gin 


Pro Pro 


Pro 


Gin 


Glu 


He 






20 








25 






30 






Phe Val 


Pro 


Thr 


Thr 


Lys 


Glu 


Pro 


Cys His 


Ser Lys 


Val 


Pro 


Gin 


Pro 




35 










40 






45 








Gly Asn 


Thr 


Lys 


He 


Pro Glu Pro Gly Cys 


Thr Lys 


Val 


Pro 


Glu 


Pro 


50 










55 






60 










Gly Cys 


Thr 


Lys 


Val 


Pro 


Glu 


Pro 


Gly Cys 


Thr Lys 


Val 


Pro 


Glu 


Pro 


65 








70 








75 








80 


Gly Cys 


Thr 


Lys 


Val 


Pro 


Glu 


Pro 


Gly Cys 


Thr Lys 


Val 


Pro 


Glu 


Pro 








85 








90 








95 




Gly Tyr 


Thr 


Lys 


Val 


Pro 


Glu 


Pro 


Gly Ser 


He Lys 


Val 


Pro 


Asp 


Gin 






100 










105 






110 






Gly Phe 


He 


Lys 


Phe 


Pro 


Glu 


Pro 


Gly Ala 


He Lys 


Val 


Pro 


Glu 


Gin 




115 










120 






125 








Gly Tyr 


Thr 


Lys 


Val 


Pro 


Val 


Pro 


Gly Tyr Thr Lys 


Val 


Pro 


Glu 


Pro 


130 










135 






140 










Cys Pro 


Ser 


Thr 


Val 


Thr 


Pro 


Gly 


Pro Ala 


Gin Gin 


Lys 


Thr Lys 


Gin 


145 








150 








155 








160 



<210> 115 
<211> 506 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 8, 21, 31, 32, 58, 75, 89, 96, 99, 103, 122, 126, 147, 150, 
158, 195, 210, 212, 219, 226, 246, 248, 249, 255, 258, 261, 
263, 265, 275, 304, 317, 321, 331, 337, 340, 358, 371, 377, 
380, 396, 450, 491 
<223> n = A,T,C or G 

<400> 115 

cattggtnct ttcatttgct ntggaagtgt nnatctctaa cagtggacaa agttcccngt 60 
gccttaaact ctgtnacact tttgggaant gaaaanttng tantatgata ggttattctg 12 0 
angtanagat gttctggata ccattanatn tgcccccngt gtcagaggct catattgtgt 18 0 
tatgtaaatg gtatntcatt cgctactatn antcaattng aaatanggtc tttgggttat 240 
gaatantnng cagcncanct nanangctgt ctgtngtatt cattgtggtc atagcacctc 300 
acancattgt aacctcnatc nagtgagaca nactagnaan ttcctagtga tggctcanga 360 
ttccaaatgg nctcatntcn aatgtttaaa agttanttaa 'gtgtaagaaa tacagactgg 42 0 
atgttccacc aactagtacc tgtaatgacn ggcctgtccc aacacatctc ccttttccat 480 
gactgtggta ncccgcatcg gaaaaa 50 6 

<210> 116 
<211> 3079 
<212> DNA 
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<213> Homo sapiens 
<400> 116 

ggatccccgg gtttcctaaa ccccccacag 
ggtcaaaggg cagaaaaaat gctgagttag 
aaagaggtca aagtggttta tagggggcgc 
cttgcaggca gatctgccca gtgggctctg 
tgtgcacaaa aggatgaaac tctattttcc 
cagattgcct ttcccagagg gaaaaccctg 
gcagatcact ggggaatcgt ttgccccccg 
caggtgctca gcatgtaccg tactgggatg 
ccaggacact gccatgccaa tgccccctca 
gccccagcct ctatggtgaa gacatacttg 
atcagtgctc gaaggcaagg ttatttctaa 
tgcaccccac accactgtgc aggtgtgacc 
ccagcccact taatcatcac agctcgacag 
taaaaagggg catcaccgtt cctgggtaac 
gttctctcca gcacctccca acccactagt 
ccaccatgtc tcgccagtca agtgtgtctt 
ccgcctctgc catcaccccg tctgtctccc 
ggggtggcgg tggtggtggc ttcggcaggg 
gctatggcag ccggagcctc tacaacctgg 
gtggtggcag cttcaggaac cggtttggtg 
gtggtgccgg tagtggattt ggtttcggcg 
gcggagctgg ctttggaggt ggcttcggtg 
gtatccaaga ggtcactgtc aaccagagtc 
ccagcatcca gagggtgagg accgaggagc 
ttgcctcctt catcgacaag gtgcggttcc 
agtggaccct gctgcaggag cagggcacca 
tcgagcagta catcaacaac ctcaggaggc 
gcctggactc agagctgaga aacatgcagg 
aggatgaaat caacaagcgt accactgctg 
tagatgctgc ctacatgaac aaggtggagc 
agattaactt catgaagatg ttctttgatg 
ctgacacctc agtggtcctc tccatggaca 
tcgctgaggt caaggcccag tatgaggaga 
cctggtatca gaccaagtat gaggagctgc 
tccgcaacac caagcatgag atctctgaga 
agattgacaa tgtcaagaaa cagtgcgcca 
agcgtgggga gctggccctc aaggatgcca 
tgcagaaggc caagcaggac atggcccggc 
ccaagctggc cctggacgtg gagatcgcca 
gcagactcag tggagaagga gttggaccag 
cctctggata tggcagtggc agtggctatg 
gcctcggtgg aggtcttgcc ggaggtagca 
gtgtcggcct aggtggtggg ctcagtgtgg 
gagggctggg ggtgggcttt ggcagtggcg 
ccaccacctc ctcctcccgg aagagcttca 
ttccaagtgc agcaacccag cccatggaga 
gttttatcct tttctggaga gtagtctaga 
ttcccaggag agccccattc ccagcccctg 
tcaaatcagc cttcaggttt cccacagcat 
tcccaaatct aaatcatcaa aacagaatcc 
taactacctc cagaatgtgt tcaataaaat 
gttttttttt tctacccaa 



agtcctgccc aggccaaaga gcaaggaaaa 60 
gaggagctat ggaaggataa acctggcctt 12 0 
tgagggcttc ccacattctc tggcctaaac 18 0 
ggatagctgt gccttcccta acaaaaaaat 24 0 
ctctagcaca taaccaagaa tataaggcta 300 
cagcaacctg ctgcctggaa aagtgtaaga 3 60 
ctgatggaca gcttccccaa gctccaaggg 420 
gttgtcaata ctcctggtcc tgtaagagtc 480 
gttcctggca tcctttttgg gctgctcaca 540 
ctagcagcgt caccaacttg ttgccaagag 600 
ctgagcagag cctgccagga agaaagcgtt 660 
ggtgagctca cagctgcccc ccaggcatgc 72 0 
ctctctcgcc cagcccagtt ctggaaggga 78 0 
agagccacct tctgcgtcct gctgagctct 840 
gcctggttct cttgctccac caggaacaag 900 
ccggagcggg gggcagtcgt agcttcagca 960 
gcaccagctt cacctccgtg tcccggtccg 1020 
tcagccttgc gggtgcttgt ggagtgggtg 1080 
ggggctccaa gaggatatcc atcagcacta 1140 
ctggtgctgg aggcggctat ggctttggag 12 00 
gtggagctgg tggtggcttt gggctcggtg 12 60 
gccctggctt tcctgtctgc cctcctggag 1320 
tcctgactcc cctcaacctg caaatcgacc 1380 
gcgagcagat caagaccctc aacaataagt 14 40 
tggagcagca gaacaaggtt ctggaaacaa 1500 
agactgtgag gcagaacctg gagccgttgt 15 60 
agctggacag catcgtgggg gaacggggcc 1620 
acctggtgga agacttcaag aacaagtatg 1680 
agaatgagtt tgtgatgctg aagaaggatg 1740 
tggaggccaa ggttgatgca ctgatggatg 18 00 
cggagctgtc ccagatgcag acgcatgtct 18 60 
acaaccgcaa cctggacctg gatagcatca 1920 
ttgccaaccg cagccggaca gaagccgagt 1980 
agcagacagc tggccggcat ggcgatgacc 2040 
tgaaccggat gatccagagg ctgagagccg 2100 
atctgcagaa cgccattgcg gatgccgagc 2160 
ggaacaagct ggccgagctg gaggaggccc 222 0 
tgctgcgtga gtaccaggag ctcatgaaca 22 80 
cttaccgcaa gctgctggag ggcgaggaat 23 4 0 
tcaacatctc tgttgtcaca agcagtgttt 2400 
gcggtggcct cggtggaggt cttggcggcg 24 60 
gtggaagcta ctactccagc agcagtgggg 2520 
ggggctctgg cttcagtgca agcagtagcc '2580 
ggggtagcag ctccagcgtc aaatttgtct 2640 
agagctaaga acctgctgca agtcactgcc 2700 
ttgcctcttc taggcagttg ctcaagccat 27 60 
ccaagccaat tgcagaacca cattctttgg 2820 
gtctcccgtg ccgcagttct atattctgct 2880 
ggcccctgct gacacgagaa cccaaagttt 2940 
ccaccccaat cccaaatttt gttttggttc 3000 
gttttataat ataagctggt gtgcagaatt 3060 
3079 



<210> 117 
<211> 6921 
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<212> DNA 

<213> Homo sapiens 

<400> 117 

gaattctgac tgtccactca aaacttctat tccgatcaaa gctatctgtg actacagaca 60 
aattgagata accatttaca aagacgatga atgtgttttg gcgaataact ctcatcgtgc 12 0 
taaatggaag gtcattagtc ctactgggaa tgaggctatg gtcccatctg tgtgcttcac 180 
cgttcctcca ccaaacaaag aagcggtgga ccttgccaac agaattgagc aacagtatca 240 
gaatgtcctg actctttggc atgagtctca cataaacatg aagagtgtag tatcctggca 300 
ttatctcatc aatgaaattg atagaattcg agctagcaat gtggcttcaa taaagacaat 360 
gctacctggt gaacatcagc aagttctaag taatctacaa tctcgttttg aagattttct 420 
ggaagatagc caggaatccc aagtcttttc aggctcagat ataacacaac tggaaaagga 480 
ggttaatgta tgtaagcagt attatcaaga acttcttaaa tctgcagaaa gagaggagca 54 0 
agaggaatca gtttataatc tctacatctc tgaagttcga aacattagac ttcggttaga 600 
gaactgtgaa gatcggctga ttagacagat tcgaactccc ctggaaagag atgatttgca 660 
tgaaagtgtg ttcagaatca cagaacagga gaaactaaag aaagagctgg aacgacttaa 72 0 
agatgatttg ggaacaatca caaataagtg tgaggagttt ttcagtcaag cagcagcctc 780 
ttcatcagtc cctaccctac gatcagagct taatgtggtc cttcagaaca tgaaccaagt 840 
ctattctatg tcttccactt acatagataa gttgaaaact gttaacttgg tgttaaaaaa 900 
cactcaagct gcagaagccc tcgtaaaact ctatgaaact aaactgtgtg aagaagaagc 960 
agttatagct gacaagaata atattgagaa tctaataagt actttaaagc aatggagatc 1020 
tgaagtagat gaaaagagac aggtattcca tgccttagag gatgagttgc agaaagctaa 1080 
agccatcagt gatgaaatgt ttaaaacgta taaagaacgg gaccttgatt ttgactggca 1140 
caaagaaaaa gcagatcaat tagttgaaag gtggcaaaat gttcatgtgc agattgacaa 1200 
caggttacgg gacttagagg gcattggcaa atcactgaag tactacagag acacttacca 1260 
tcctttagat gattggatcc agcaggttga aactactcag agaaagattc aggaaaatca 1320 
gcctgaaaat agtaaaaccc tagccacaca gttgaatcaa cagaagatgc tggtgtccga 1380 
aatagaaatg aaacagagca aaatggacga gtgtcaaaaa tatgcagaac agtactcagc 14 4 0 
tacagtgaag gactatgaat tacaaacaat gacctaccgg gccatggtag attcacaaca 1500 
aaaatctcca gtgaaacgcc gaagaatgca gagttcagca gatctcatta ttcaagagtt 1560 
catggaccta aggactcgat atactgccct ggtcactctc atgacacaat atattaaatt 1620 
tgctggtgat tcattgaaga ggctggaaga ggaggagatt aaaaggtgta aggagacttc 1680 
tgaacatggg gcatattcag atctgcttca gcgtcagaag gcaacagtgc ttgagaatag 17 40 
caaacttaca ggaaagataa gtgagttgga aagaatggta gctgaactaa agaaacaaaa 1800 
gtcccgagta gaggaagaac ttccgaaggt cagggaggct gcagaaaatg aattgagaaa 18 60 
gcagcagaga aatgtagaag atatctctct gcagaagata agggctgaaa gtgaagccaa 1920 
gcagtaccgc agggaacttg aaaccattgt gagagagaag gaagccgctg aaagagaact 1980 
ggagcgggtg aggcagctca ccatagaggc cgaggctaaa agagctgccg tggaagagaa 2 04 0 
cctcctgaat tttcgcaatc agttggagga aaacaccttt accagacgaa cactggaaga 2100 
tcatcttaaa agaaaagatt taagtctcaa tgatttggag caacaaaaaa ataaattaat 2160 
ggaagaatta agaagaaaga gagacaatga ggaagaactc ttgaagctga taaagcagat 2220 
ggaaaaagac cttgcatttc agaaacaggt agcagagaaa cagttgaaag aaaagcagaa 2280 
aattgaattg gaagcaagaa gaaaaataac tgaaattcag tatacatgta gagaaaatgc 2340 
attgccagtg tgtccgatca cacaggctac atcatgcagg gcagtaacgg gtctccagca 2400 
agaacatgac aagcagaaag cagaagaact caaacagcag gtagatgaac taacagctgc 24 60 
caatagaaag gctgaacaag acatgagaga gctgacatat gaacttaatg ccctccagct 252 0 
tgaaaaaacg tcatctgagg aaaaggctcg tttgctaaaa gataaactag atgaaacaaa 2580 
taatacactc agatgcctta agttggagct ggaaaggaag gatcaggcgg agaaagggta 2640 
ttctcaacaa ctcagagagc ttggtaggca attgaatcaa accacaggta aagctgaaga 2700 
agccatgcaa gaagctagtg atctcaagaa aataaagcgc aattatcagt tagaattaga 27 60 
atctcttaat catgaaaaag ggaaactaca aagagaagta gacagaatca caagggcaca 2820 
tgctgtagct gagaagaata ttcagcattt aaattcacaa attcattctt ttcgagatga 2880 
gaaagaatta gaaagactac aaatctgcca gagaaaatca gatcatctaa aagaacaatt 2940 
tgagaaaagc catgagcagt tgcttcaaaa tatcaaagct gaaaaagaaa ataatgataa 3000 
aatccaaagg ctcaatgaag aattggagaa aagtaatgag tgtgcagaga tgctaaaaca 3060 
aaaagtagag gagcttacta ggcagaataa tgaaaccaaa ttaatgatgc agagaattca 312 0 
ggcagaatca gagaatatag ttttagagaa acaaactatc cagcaaagat gtgaagcact 3180 
gaaaattcag gcagatggtt ttaaagatca gctacgcagc acaaatgaac acttgcataa 3240 
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acagacaaaa acagagcagg attttcaaag 
gaaaagtcaa aatttggtaa gtgaatttaa 
ccagaatacc aagaaagaag ttagaaatct 
gaagcgacgc ggggagcaga aagttcagct 
caggttgaaa aaagtacaag acgaattaca 
cagaaagatg gttctgtttc aggaagaatc 
tcggaagaag atggaaaaat taatggagtc 
cattaggctt gactttgtgt ctcttcaaca 
gctttgtgaa acaaacatta aagaacttga 
gcagcaaggg cagcacatgg aagcaaatca 
gctgatagcc cagaagcgtg aggttgaaaa 
agagcatgaa catcaattag ttttgctcca 
agactgtacc ttcaaaccag attttgagat 
gctgtcctct agaaacactg gacaccttca 
gactcaagaa ccacagccat tggaagagaa 
caaagaagtc caattccagc caccaggggc 
ttactctgag tacttttctc agacaagcac 
ccccattaca agactgtctg aaattgagaa 
accacctgtt aggtatcaag ataacgcatg 
cttagagata gctaagaaca agcagtatga 
agaaaagaac ccagttccca gtgctgaaga 
tggactcaag aaaggggatt tccttaagaa 
tgatggtgat catgcatgtt cagtcaggga 
cactgtgact gccaggcagt tggtggaagc 
gctgcgactc ggtcttaaga ctgttgaaga 
gaaagccacc tcaattgcag ggctttacct 
ctcagcggcc gagagaatca taatagacaa 
ggctgcaaca ggttttataa ttgatcccat 
agttcttaaa ggagttgttg accccgaatt 
agctgtggga tattcttatt cttctaagac 
aatgcttgac agacaaaaag gtaaacatat 
cattgaccct gtgagaggca ttcgtgttcc 
gaataatgcc atcttacagt ttttacatga 
tcccaataac aagcaagctc tgtattactc 
agagtcccaa tgctttctgt ttccatttgg 
gaaaacacat agaatttctg tagtagatac 
ggctttccag agaaacctga ttgagaaaag 
tcagtggaag gaagctatgt tttttgaatc 
tactaaaaca ggattacact tcaatattaa 
agccttggtc aaaaagtatc aggaaggcct 
gctgagccgg ttagtcccca agaaagattt 
tgctagtggg gaaaggatct ctgtactaaa 
tactgccctc cgatgccttg aagcccaagt 
tggcaaaaag taccgggtgg ccgaagcttt 
ccagcagctg cgacagtgtg aattagtaat 
aatgatgtca gtggtggaag ctgtgaatgc 
atgtttggaa tttcagtact tgacaggagg 
atcaatagaa gaggctctcc aagtaggtat 
agatcaaaag tcatatgtca gaaatataat 
taaagaagcc ttagaaaaag ctgattttga 
atctgagccc ctgatgacag gaatttctag 
taaataactg tgcaaggggt gatgcaggct 
tatcggctac atatgcagtc tgtgaattat 
attgctaagt gctcaaaata gagtaagttt 
cttcaaatgg tttcatttag ccttgagaat 
tttttttttt tttacgtaga atgtgggata 
tttcttcaga actccccttc attgaatagt 
ctgaaagagc acgtcatgaa gcaccatgga 



aaaaattaaa tgcctagaag aagacctggc 3300 

gcaaaagtgt gaccaacaga acattatcat 3360 

gaatgcggaa ctgaatgctt ccaaagaaga 342 0 

acaacaagct caggtgcaag agttaaataa 3480 

cttaaagacc atagaggagc agatgaccca 3540 

tggtaaattc aaacaatcag cagaggagtt 3600 

caaagtcatc actgaaaatg atatttcagg 3660 

agaaaactct agagcccaag aaaatgctaa 372 0 

aagacagctt caacagtatc gtgaacaaat 37 80 

ttaccaaaaa tgtcagaaac ttgaggatga 38 4 0 

cctgaagcaa aaaatggacc aacagatcaa 3900 

gtgtgaaatt caaaaaaaga gcacagccaa 3960 

gacagtgaag gagtgccagc actctggaga 4020 

cccaacaccc agatcccctc tgttgagatg 4080 

gtggcagcat cgggttgttg aacagatacc 4140 

tccactcgag aaagagaaaa gccagcagtg 4200 

cgagttacag ataacttttg atgagacaaa 4260 

gataagagac caagccctga acaattctag 4320 

tgaaatggaa ctggtgaagg ttttgacacc 4380 

tatgcataca gaagtcacaa cattaaaaca 4440 

atggatgctt gaagggtgca gagcatctgg 4500 

gggcttagaa ccagagacct tccagaactt 4560 

tgatgaattt aaattccaag ggcttaggca 4 620 

taagcttctg gacatgagaa caattgagca 4680 

agttcagaaa actcttaaca agtttctgac 47 40 

agaatctaca aaagaaaaga tttcatttgc 4800 

aatggtggct ttggcatttt tagaagctca 4 8 60 

ttcaggtcag acatattctg ttgaagatgc 4920 

cagaattagg cttcttgagg cagagaaggc 4980 

attgtcagtg tttcaagcta tggaaaatag 5040 

'cttggaagcc cagattgcca gtgggggtgt 5100 

tccagaaatt gctctgcagc aggggttgtt 5160 

gccatccagc aacacaagag ttttccctaa 5220 

agaattactg cgaatgtgtg tatttgatgt 5280 

ggagaggaac atttccaatc tcaatgtcaa 5340 

taaaacagga tcagaattga ccgtgtatga 5400 

tatatatctt gaactttcag ggcagcaata 54 60 

ctatgggcat tcttctcata tgctgactga 5520 

tgaggctata gagcagggaa caattgacaa 5580 

catcacactt acagaacttg ctgattcttt 5640 

gcacagtcct gttgcagggt attggctgac 57 00 

agcctcccgt agaaatttgg ttgatcggat 57 60 

cagtacaggg ggcataattg atcctcttac 5820 

gcatagaggc ctggttgatg aggggtttgc 5880 

cacagggatt ggccatccca tcactaacaa 5940 

aaatattata aataaggaaa tgggaatccg 6000 

gttgatagag ccacaggttc actctcggtt 6060 

tatagatgtc ctcattgcca caaaactcaa 6120 

atgccctcag acaaaaagaa agttgacata 6180 

tttccacaca ggacttaaac tgttagaagt 6240 

cctctactat tcttcctaat gggacatgtt 6300 

ggttcatgcc actttttcag agtatgatga 6360 

gtaacatact ctatttcttg agggctgcaa 6420 

taaattgaaa attacataag atttaatgcc 64 8 0 

ggttttttga aacttggcca cactaaaatg 6540 

aacttgatga actccaagtt cacagtgtca 6600 

gatcatttat taaatgataa attgcactcg 6660 

atcaaagaga aagatataaa ttcgttccca 6720 
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cagccttcaa gctgcagtgt tttagattgc 
gatatagtga ccttctttgc atattaaaat 
tcttcgcact tgaaagctaa cattatgaat 
ttcattctgt gtattttccg g 



ttcaaaaaat gaaaaagttt tgcctttttc 67 8 0 
gtttaccaca atgtcccatt tctagttaag 68 4 0 
attatgtgtt ggaggagggg aaggattttc 6900 
6921 



<210> 118 
<211> 946 
<212> DNA 
<213> Homo sapiens 



<400> 118 

cttctgactg ggctcaggct gacaggtaga 
ctccccatca cagctgtggt gcagtccacc 
gtcggcagtg gcttaggcct gggtggagga 
gttggaggtg gcttcagttc cagcagtggc 
ggaggcggca gttccaccat caagtacacc 
aagcactaaa gtgcgtctgc tagctctcgg 
cagagccctc tcctcaggtt gcctgtcctc 
ggtagagctg gggatgaatg cttagtgccc 
gagcacccat tgctcaccat cagatcaacc 
tggagcttca ctgttactaa attattaatt 
gagcattata agaaaatgac ctctgctcct 
ttcagaacaa cttccactta ctttccactg 
tgaaccccca cccaggcagt atccatgaaa 
gcctgtatct ctgtgatgat ttctgtgctc 
atttataata catatattct tttactttgc 
acttttttat ctgataagtg aatagttgtt 

<210> 119 
<211> 8948 
<212> DNA 
<213> Homo sapiens 



gctcaccatg gcttcttgtg tccttgtccc 60 
gtctccagtg gctatggcgg tgccagtggt 12 0 
agcagctact cctatggcag tggtcttggc 18 0 
agagccattg ggggtggcct cagctctgtt 24 0 
accacctcct cctccagcag gaagagctat 300 
tcccacagtc ctcaggcccc tctctggctg 360 
tcctggcctc cagtctcccc tgctgtccca 420 
tcacttcttc tctctctctc tataccatct 480 
tctgatttta catcatgatg taatcaccac 540 
tcttgcctcc agtgttctat ctctgaggct 600 
tttcattgca gaaaattgcc aggggcttat 660 
gctctcaaac tctctaactt ataagtgttg 720 
gcacaagtga ctagtcctat gatgtacaaa 780 
ttcactgttt gcaattgcta aataaagcag 840 
cttgctttgg ggccaaagtt ttgggcttaa 900 
tttaaaagat aatcta 94 6 



<400> 119 

tcaacagccc ctgctccttg ggcccctcca 
acaccaacac ccagctccga cgcagctcct 
tttcctcccg ctcctgcccc cggcccgtcg 
ggcccaggta gcgagcagcg acctcgcgag 
gtccgcctat ccttggcccc ctccgctttc 
gcgctgagcc gctctcccga ttgcccgccg 
ggatcaacac tctgggccgc atgatccgcg 
tgaccagcgg cggcgggggc accagcagga 
accagaactc ggacggctac tgtcaaaccg 
ccatccagga gctgctgcag aactgctccg 
agcctgaatt gaagtatgga gatggaatac 
gttttgccca ggccaatgac caaatggaaa 
agatgggcca gccctgtgat gcttaccaga 
gagcccttta taaagccatc agtgtccctc 
gaggctacac ttgtcagagt ggctctggct 
aatgtttggg gtggatgagg cagcaaaggg 
acctggcctc agtggagcag cacattaaca 
actatcgctg gcagctggac aaaatcaaag 
agttggagga ggagtatgaa aacctgctga 
gacagctgca gaacatcatt caggccacgt 
aggaggagga gctgctgtac gactggagcg 
aggccttctc catacgcatg agtcaactgg 



tgccatgccg taatctctcc cacccgacca 60 
ctgcgccctt gccgccctcc gagccacagc 120 
ccgtctccgc gctcgcagcg gcctcgggag 180 
ccttccgcac tcccgcccgg ttccccggcc 240 
tccgcgccgg cccgcctcgc ttatgcctcg 300 
acatgagctg caacggaggc tcccacccgc 3 60 
ccgagtctgg cccggacctg cgctacgagg 420 
tgtactattc tcggcgcggc gtgatcaccg 480 
gcacgatgtc caggcaccag aaccagaaca 54 0 
actgcttgat gcgagcagag ctcatcgtgc 600 
aactgactcg gagtcgagaa ttggatgagt 660 
tcctcgacag cttgatcaga gagatgcggc 72 0 
aaaggcttct tcagctccaa gagcaaatgc 780 
gagtccgcag ggccagctcc aagggtggtg 840 
gggatgagtt caccaaacat gtcaccagtg 900 
cggagatgga catggtggcc tggggtgtgg 960 
gccaccgggg catccacaac tccatcggcg 1020 
ccgacctgcg cgagaaatct gcgatctacc 1080 
aagcgtcctt tgagaggatg gatcacctgc 1140 
ccagggagat catgtggatc aatgactgcg 1200 
acaagaacac caacatcgct cagaaacagg 12 60 
aagttaaaga aaaagagctc aataagctga 1320 
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aacaagaaag tgaccaactt gtcctcaatc 
atatggacac tctgcagacg cagtggagtt 
ttcatctgaa agaaaatgct gcctactttc 
catacctgaa ggggctccag gactccatca 
ccctgcagca cctgctggaa cagatcaagg 
aatacaagcg tcaggtgcag aacttggtaa 
ctcgtaaccc agactacaga agcaataaac 
aacaagatca gaaaatcgtg cataaggggg 
gcagcaagtg gtacgtgacg ggcccgggag 
tgatcatccc tcctccgaac ccactggccg 
acgaagccat cttggctctg tggaaccagc 
ggcactactg catgattgac atagagaaga 
caatgcggca ggaagattac atgaagacga 
tcatcagaaa tagccaaggc tcagagatgt 
ctcagttcac cgatgcccag aagcattacc 
cccagcacca gacagtgacc acaactgaaa 
accataataa agtaattgaa accaacagag 
tggagctgca gaagattcgc aggcagatag 
acctccctct agcagaccag gggtcttctc 
agagtgtgca gaatgattca caagcaattg 
ttgccaactt cagaggttct gaaaagtact 
ttcagaaact ggaaaatatc aatggtgtta 
taagggcact gctccaggct attctccaaa 
ggctcactga ggaggaaact gtctgcctgg 
gactgaagaa aataaaaaat gacttgaact 
cagaactaca gaaagcccag cagatccact 
atctggactt gggcaagttc ggtgaaaaag 
tagataaaca gatcgacttt agattatggg 
attatcgtga taactatcag gctttctgca 
attccttaga atccatgaaa tttggagatt 
agaagaactt gcacagtgaa atatctggca 
ttgctgaact ttgcgccaat tcaattaagg 
caggactgga aactctgctg aacataccta 
gggtgattct gcaagaggct gcagatgttc 
ctggagacta ttacaggttc ttaagtgaga 
aaaataccaa gatcgaagtt ttggaagagg 
aaaactgtaa taagaacaaa ttcctggatc 
cccagttcaa agcgaagctt gcgagcctgg 
ggaagtcggc taagcaaaat ctagacaagt 
agatcacccg actgacttat gagattgaag 
acagatttga ccaacagaag aatgactatg 
aggagaacct tggttggcag aaattagagt 
agattgaaag gttgagggtt ctactgcagg 
atgagctggc aaaggtaaga aaccactata 
atgaaacaga gattaacatt acgaagacca 
atgattccaa aaatcttaga aaccagcttg 
aggatgaaat tgtcaggctc aatgacagca 
ctgaagaaaa cgcccttcag caaaaggcct 
atctggagat agaactgaag caggtcatgc 
agcagtccct ggaggaggct gccaagacca 
tcaaagctga gtttcaggag gaggccaagc 
aggtaagaaa caattatgat gaggagatca 
tcaacatcac caagaccacc atccaccagc 
gctaccgggc tcagatagac aatctcaccc 
agaggctgaa gaacactcta acccagacca 
tccaacagca aaaggccact ggctctgagg 
agctgagaca agtcactcag atgcgaacag 
atgatgctgc caaaaccatc caggataaaa 



agcatccagc ttcagacaaa attgaggcct 1380 
ggattcttca gatcaccaag tgcattgatg 1440 
agttttttga agaggcgcag tctactgaag 15 00 
ggaagaagta cccctgcgac aagaacatgc 15 60 
agctggagaa agaacgagag aaaatccttg 1620 
acaagtctaa gaagattgta cagctgaagc 1680 
ccattattct cagagctctc tgtgactaca 17 4 0 
atgagtgtat cctgaaggac aacaacgagc 18 00 
gcgttgacat gcttgttccc tctgtggggc 18 60 
tggacctctc ttgcaagatt gagcagtact 1920 
tctacatcaa catgaagagc ctggtgtcct 1980 
tcagggccat gacaatcgcc aagctgaaaa 2 040 
tagccgacct tgagttacat taccaagagt 2100 
ttggagatga tgacaagcgg aaaatacagt 2160 
agaccctggt cattcagctc cctggctatc 2220 
tcactcatca tggaacctgc caagatgtca 22 80 
aaaatgacaa gcaagaaaca tggatgctga 2340 
agcactgcga gggcaggatg actctcaaaa 2 4 00 
accacatcac agtgaaaatt aacgagctta 24 60 
ctgaggttct caaccagctt aaagatatgc 2520 
gctatttaca gaatgaagta tttggactat 2580 
cagatggcta cttaaatagc ttatgcacag 2 640 
cagaagacat gttaaaggtt tatgaagcca 27 00 
acctggataa agtggaagct taccgctgtg 27 60 
tgaagaagtc gttgttggcc actatgaaga 2820 
ctcagacttc acagcagtat ccactttatg 2880 
tcacacagct gacagaccgc tggcaaagga 2940 
acctggagaa acaaatcaag caattgagga 3000 
agtggctcta tgatcgtaaa cgccgccagg 30 60 
ccaacacagt catgcggttt ttgaatgagc 3120 
aacgagacaa atcagaggaa gtacaaaaaa 3180 
attatgagct ccagctggcc tcatacacct 32 4 0 
tcaagaggac catgattcag tccccttctg 3300 
atgctcggta cattgaacta cttacaagat 3360 
tgctgaagag tttggaagat ctgaagctga 3420 
agctcagact ggcccgagat gccaactcgg 3480 
agaacctgca gaaataccag gcagagtgtt 3540 
aggagctgaa gagacaggct gagctggatg 3600 
gctacggcca aataaaagaa ctcaatgaga 3 6 60 
atgaaaagag aagaagaaaa tctgtggaag 3720 
accaactgca gaaagcaagg caatgtgaaa 37 80 
ctgagaaagc catcaaggag aaggagtacg 3840 
aagaaggcac ccggaagaga gaatatgaaa 3900 
atgaggagat gagtaattta aggaacaagt 3960 
ccatcaagga gatatccatg caaaaagagg 4020 
atagactttc aagggaaaat cgagatctga 4 080 
tcttgcaggc cactgagcag cgaaggcgag 4140 
gtggctctga gataatgcag aagaagcagc 42 00 
agcagcgctc tgaggacaat gcccggcaca 42 60 
ttcaggacaa aaataaggag atcgagagac 4320 
gccgctggga atatgaaaat gaactgagta 43 8 0 
ttagcttaaa aaatcagttt gagaccgaga 44 40 
tcaccatgca gaaggaagag gataccagtg 4500 
gagaaaacag gagcttatct gaagaaataa 4560 
cagagaatct caggagggtg gaagaagaca 4 62 0 
tgtctcagag gaaacagcag ctggaggttg 4 680 
aggagagcgt aagatataag caatctcttg 47 40 
acaaggagat agaaaggtta aaacaactga 4 8 00 
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tcgacaaaga aacaaatgac cggaaatgcc 
tccagtatga cctgcagaaa gcaaacagta 
ttcaggagca agaactgaca cgcctgagga 
ctgtgaagga ccaggatatc acgcggttcc 
agcagaaggt ggaagaggag ctgaatcggc 
agaggaagaa gctggaggaa gagctggaag 
tcaaaatcac caacctgacc cagcagctgg 
aggatgacct ccggcagcag agggacgtgc 
cccaggaaga gctgaggagg ctctcttctg 
aggaacagga aagtgtcaaa caagctcact 
aagataaaag cagaagctta aatgaaagca 
cagagaacct gaccaaggag cacttgatgt 
agtacgatga cctgaggaga ggacgaagcg 
tggaactaag gagccagctg cagatcagca 
ttaatgattt acagagagag agggaaaatt 
aggctttaga ggcatctaat aggattcagg 
aggaaagaga gagccttctg gtgaaaatca 
agaggctgga ggatgagctg aatcgtgcaa 
aacagcgcct ggagtgtgag aaacagcaaa 
aatattcccg caaggaggag gctattagga 
gagagaagaa cagtcttagg agtgagatcg 
aagagaggtg caggcgtaag ctggaggatt 
cagaacgctc ccgatatcag agggagattg 
atcgagagac ccagactgag tgtgagtgga 
ggctgaggaa gaaggtgaca gcaatgcagc 
ccttggacaa actattgaag gggaagaagt 
cattccttcg gggtgcagga tctatcgctg 
ctttggtaga ggccaagaga aagaaattaa 
aggcccaggc agctacaggt ggtataattg 
acagtgccat agctcgggac ctcattgact 
aaaaagctat cactggtttt gatgatccat 
ccatcaagaa aaatttgatt gatagagaaa 
cttcaggggg tgtagtagac cctgtgaaca 
cccgggggct gattgataga gatttgtatc 
aaaactttgt ggatccagtc accaaaaaga 
gcagaatcga accacatact ggtctgctct 
tccaaggaat cagacaacct gtgaccgtca 
cgtccactgt caatgaactg gaatctggtc 
ttaaggactt cctccagggt tcaagctgca 
agaagcttgg catttatgag gccatgaaaa 
agttgctgga agcccaagca gctactggct 
taccagtgga ggaagcctac aagagaggtc 
tgtctgcaga acgagctgtc actgggtata 
tgttccaagc catgaataag gaactcatcg 
cacagatcgc aaccgggggg atcattgacc 
tagcatataa gaggggctat ttcaatgagg 
atgataccaa aggatttttt gaccccaaca 
aagaaagatg cattaaggat gaggaaacag 
agaaacaggt gcagacatca caaaagaata 
acccagaaac caataaagaa atgtctgttc 
atgaaacctt caaagaactg tgtgagcagg 
gatcagatgg ctccaccagg gtggtcctgg 
ttcaagatgc tattgacaag ggccttgttg 
gcagcctcag cctcactcaa tttgctgaca 
gcagcagcat gggcagtggt gtcagcgatg 
taagtaagat ttccaccata tccagcgtca 
cagacaccct ggaagaatcg agccccattg 
aaatctccat tacagaaggt atagagcggg 



tggaagatga aaacgcgaga ttacaaaggg 48 60 
gtgcgacgga gacaataaac aaactgaagg 4920 
tcgactatga aagggtttcc caggagagga 4980 
agaactctct gaaagagctg cagctgcaga 5040 
tgaagaggac cgcgtcagaa gactcctgca 5100 
gcatgaggag gtcgctgaag gagcaagcca 5160 
agcaggcatc cattgttaag aagaggagtg 5220 
tggatggcca cctgagggaa aagcagagga 52 8 0 
aggtcgaggc cctgaggcgg cagttactcc 534 0 
tgaggaatga gcatttccag aaggcgatag 54 00 
aaatagaaat tgagaggctg cagtctctca 54 60 
tagaagaaga actgcggaac ctgaggctgg 552 0 
aagcggacag tgataaaaat gcaaccatct 5580 
acaaccggac cctggaactg caggggctga 5640 
tgagacagga aattgagaaa ttccaaaagc 5700 
aatcaaagaa tcagtgtact caggtggtac 57 60 
aagtcctgga gcaagacaag gcaaggctgc 5820 
aa.tcaactct agaggcagaa accagggtga 58 8 0 
ttcagaatga cctgaatcag tggaagactc 5940 
agatagaatc ggaaagagaa aagagtgaga 6000 
aaagactcca agcagagatc aagagaattg 6060 
ctaccaggga gacacagtca cagttagaaa 6120 
ataaactcag acagcgccca tatgggtccc 6180 
ccgttgacac ctccaagctg gtgtttgatg 6240 
tctatgagtg tcagctgatc gacaaaacaa 6300 
cagtggaaga agttgcttct gaaatccagc 6360 
gagcatctgc ttctcctaag gaaaaatact 6420 
tcagcccaga atccacagtc atgcttctgg 6480 
atccccatcg gaatgagaag ctgactgtcg 6540 
tcgatgaccg tcagcagata tatgcagcag 6600 
tttcaggcaa gacagtatct gtttcagaag 6660 
ccggaatgcg cctgctggaa gcccagattg 6720 
gtgtcttttt gccaaaagat gtcgccttgg 67 80 
gatccctgaa tgatccccga gatagtcaga 6840 
aggtcagtta cgtgcagctg aaggaacggt 6900 
tgctttcagt acagaagaga agcatgtcct 6960 
ctgagctagt agattctggt atattgagac 7020 
agatttctta tgacgaggtt ggtgagagaa 7080 
tagcaggcat atacaatgag accacaaaac 7140 
ttggcttagt ccgacctggt actgctctgg 7200 
ttatagtgga tcctgttagc aacttgaggt 7260 
tggtgggcat tgagttcaaa gagaagctcc 732 0 
atgatcctga aacaggaaac atcatctctt 7380 
aaaagggcca cggtattcgc ttattagaag 7 44 0 
caaaggagag ccatcgttta ccagttgaca 7500 
aactcagtga gattctctca gatccaagtg 7560 
ctgaagaaaa tcttacctat ctgcaactaa 7620 
ggctctgtct tctgcctctg aaagaaaaga 7 680 
ccctcaggaa gcgtagagtg gtcatagttg 77 40 
aggaggccta caagaagggc ctaattgatt 7 8 00 
aatgtgaatg ggaagaaata accatcacgg 78 60 
tagatagaaa gacaggcagt cagtatgata 7 920 
acaggaagtt ctttgatcag taccgatccg 7980 
tgatctcctt gaaaaatggt gtcggcacca 8040 
atgtttttag cagctcccga catgaatcag 8100 
ggaatttaac cataaggagc agctcttttt 8160 
cagccatctt tgacacagaa aacctggaga 8220 
gcatcgttga cagcatcacg ggtcagaggc 8280 



WO 02/47534 



55 



PCT7US01/47576 



ttctggaggc tcaggcctgc acaggtggca tcatccaccc aaccacgggc cagaagctgt 8340 
cacttcagga cgcagtctcc cagggtgtga ttgaccaaga catggccacc agcgtgaagc 8400 
ctgctcagaa agccttcata ggcttcgagg gtgtgaaggg aaagaagaag atgtcagcag 8 4 60 
cagaggcagt gaaagaaaaa tggctcccgt atgaggctgg ccagcgcttc ctggagttcc 8520 
agtacctcac gggaggtctt gttgacccgg aagtgcatgg gaggataagc accgaagaag 8580 
ccatccggaa ggggttcata gatggccgcg ccgc'acagag gctgcaagac accagcagct 8 640 
atgccaaaat cctgacctgc cccaaaacca aattaaaaat atcctataag gatgccataa 8700 
atcgctccat ggtagaagat atcactgggc tgcgccttct ggaagccgcc tccgtgtcgt 87 60 
ccaagggctt acccagccct tacaacatgt cttcggctcc ggggtcccgc tccggctccc 8820 
gctcgggatc tcgctccgga tctcgctccg ggtcccgcag tgggtcccgg agaggaagct 8880 
ttgacgccac agggaattct tcctactctt attcctactc atttagcagt agttctattg 8940 
ggcactag 8948 



<210> 120 
<211> 587 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 91, 131, 256, 263, 332, 392, 400, 403, 461, 496, 497, 499, 
510, 511, 518, 519, 539, 554, 560, 576 
<223> n = A,T,C or G 



<400> 120 

cgtcctaagc acttagacta catcagggaa gaacacagac cacatccctg tcctcatgcg 60 
gcttatgttt tctggaagaa agtggagacc nagtccttgg ctttagggct ccccggctgg 12 0 
gggctgtgca ntccggtcag ggcgggaagg gaaatgcacc gctgcatgtg aacttacagc 180 
ccaggcggat gccccttccc ttagcactac ctggcctcct gcatcccctc gcctcatgtt 24 0 
cctcccacct tcaaanaatg aanaacccca tgggcccagc cccttgccct ggggaaccaa 300 
ggcagccttc caaaactcag gggctgaagc anactattag ggcaggggct gactttgggt 360 
gacactgccc attccctctc agggcagctc angtcacccn ggnctcttga acccagcctg 420 
ttcctttgaa aaagggcaaa actgaaaagg gcttttccta naaaaagaaa aaccagggaa 48 0 
ctttgccagg gcttcnntnt taccaaaacn ncttctcnng gatttttaat tccccattng 540 
gcctccactt accnggggcn atgccccaaa attaanaatt tcccatc 587 

<210> 121 
<211> 619 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 260, 527, 560, 564, 566, 585, 599 
<223> n = A,T,C or G 



<400> 121 

cactagtagg atagaaacac tgtgtcccga 
gcctaaccca ggttaactgc aagaagaggc 
tgcataaagc caatgtagtc cagtttctaa 
tcaatacaca ctcatgaact cctgatggaa 
tgcacacttg ctagactcan aaaaaatact 
gacaacctac tttgcttggc tgagtgaagg 
gacatttagt tagtgctttt tatataccag 
tttccaaatt tttgtacagt cgctgcacat 
aatgaagtcc ctggtttttc atggcaactt 



gagtaaggag agaagctact attgattaga 60 
gggatacttt cagctttcca tgtaactgta 12 0 
gatcatgttc caagctaact gaatcccact 18 0 
caataacagg cccaagcctg tggtatgatg 24 0 
actctcataa atgggtggga gtattttggt 300 
aatgatattc atatattcat ttattccatg 360 
gcatgatgct gagtgacact cttgtgtata 42 0 
atttgaaatc atatattaag acttccaaaa 480 
gatcagtaaa ggattcncct ctgtttggta 54 0 
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cttaaaacat ctactatatn gttnanatga 
aagtggtggg gaaaaaaaa 

<210> 122 
<211> 1475 
<212> DNA 
<213> Homo sapiens 

<400> 122 

tccacctgtc cccgcagcgc cggctcgcgc 
agcgccccga cctcgccacc atgagagccc 
tcgtgagcga ctccaaaggc agcaatgaac 
taaatggagg aacatgtgtg tccaacaagt 
caaagaaatt cggagggcag cactgtgaaa 
atggtcactt ttaccgagga aaggccagca 
ggaactctgc cactgtcctt cagcaaacgt 
tgggcctggg gaaacataat tactgcagga 
atgtgcaggt gggcctaaag ccgcttgtcc 
gaaaaaagcc ctcctctcct ccagaagaat 
ggccccgctt taagattatt gggggagaat 
cggccatcta caggaggcac cgggggggct 
tcagcccttg ctgggtgatc agcgccacac 
actacatcgt ctacctgggt cgctcaaggc 
ttgaggtgga aaacctcatc ctacacaagg 
acgacattgc cttgctgaag atccgttcca 
ctatacagac catctgcctg ccctcgatgt 
agatcactgg ctttggaaaa gagaattcta 
tgactgttgt gaagctgatt tcccaccggg 
aagtcaccac caaaatgctg tgtgctgctg 
gagactcagg gggacccctc gtctgttccc 
tgagctgggg ccgtggatgt gccctgaagg 
acttcttacc ctggatccgc agtcacacca 
ccccagggag gaaacgggca ccacccgctt 
catctccatc agctgtaaga agagactggg 

<210> 123 
<211> 2294 
<212> DNA 
<213> Homo sapiens 

<400> 123 

cagcgccggc tcgcgccctc ctgccgcagc 
gccaccatga gagccctgct ggcgcgcctg 
aaaggcagca atgaacttca tcaagttcca 
tgtgtgtcca acaagtactt ctccaacatt 
gggcagcact gtgaaataga taagtcaaaa 
cgaggaaagg ccagcactga caccatgggc 
gtccttcagc aaacgtacca tgcccacaga 
cataattact gcaggaaccc agacaaccgg 
ctaaagccgc ttgtccaaga gtgcatggtg 
tctcctccag aagaattaaa atttcagtgt 
attattgggg gagaattcac caccatcgag 
aggcaccggg ggggctctgt cacctacgtg 
gtgatcagcg ccacacactg cttcattgat 
ctgggtcgct caaggcttaa ctccaacacg 
ctaatcctac acaaggacta cagcgctgac 
ctgaagatcc gttccaagga gggcaggtgt 
tgcctgccct cgatgtataa cgatccccag 



aattcctttt ccccncctcc cgaaaaaana 600 
619 



cctcctgccg cagccaccga gccgccgtct 60 
tgctggcgcg cctgcttctc tgcgtcctgg 120 
ttcatcaagt tccatcgaac tgtgactgtc 180 
acttctccaa cattcactgg tgcaactgcc 240 
tagataagtc aaaaacctgc tatgagggga 300 
ctgacaccat gggccggccc tgcctgccct 3 60 
accatgccca cagatctgat gctcttcagc 42 0 
acccagacaa ccggaggcga ccctggtgct 480 
aagagtgcat ggtgcatgac tgcgcagatg 540 
taaaatttca gtgtggccaa aagactctga 600 
tcaccaccat cgagaaccag ccctggtttg 660 
ctgtcaccta cgtgtgtgga ggcagcctca 720 
actgcttcat tgattaccca aagaaggagg 78 0 
ttaactccaa cacgcaaggg gagatgaagt 840 
actacagcgc tgacacgctt gctcaccaca 900 
aggagggcag gtgtgcgcag ccatcccgga 960 
ataacgatcc ccagtttggc acaagctgtg 102 0 
ccgactatct ctatccggag cagctgaaga 1080 
agtgtcagca gccccactac tacggctctg 1140 
acccacagtg gaaaacagat tcctgccagg 1200 
tccaaggccg catgactttg actggaattg 12 60 
acaagccagg cgtctacacg agagtctcac 1320 
aggaagagaa tggcctggcc ctctgagggt 1380 
tcttgctggt tgtcattttt gcagtagagt 1440 
aagat 1475 



caccgagccg ccgtctagcg ccccgacctc 60 
cttctctgcg tcctggtcgt gagcgactcc 120 
tcgaactgtg actgtctaaa tggaggaaca 180 
cactggtgca actgcccaaa gaaattcgga 24 0 
acctgctatg aggggaatgg tcacttttac 300 
cggccctgcc tgccctggaa ctctgccact 360 
tctgatgctc ttcagctggg cctggggaaa 420 
aggcgaccct ggtgctatgt gcaggtgggc 480 
catgactgcg cagatggaaa aaagccctcc 540 
ggccaaaaga ctctgaggcc ccgctttaag 60 0 
aaccagccct ggtttgcggc catctacagg 660 
tgtggaggca gcctcatcag cccttgctgg 720 
tacccaaaga aggaggacta catcgtctac 780 
caaggggaga tgaagtttga ggtggaaaac 84 0 
acgcttgctc accacaacga cattgccttg 900 
gcgcagccat cccggactat acagaccatc 960 
tttggcacaa gctgtgagat cactggcttt 1020 
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ggaaaagaga attctaccga ctatctctat ccggagcagc tgaaaatgac tgttgtgaag 1080 
ctgatttccc accgggagtg tcagcagccc cactactacg gctctgaagt caccaccaaa 1140 
atgctgtgtg ctgctgaccc acagtggaaa acagattcct gccagggaga ctcaggggga 12 00 
cccctcgtct gttccctcca aggccgcatg actttgactg gaattgtgag ctggggccgt 12 60 
ggatgtgccc tgaaggacaa gccaggcgtc tacacgagag tctcacactt cttaccctgg 1320 
atccgcagtc acaccaagga agagaatggc ctggccctct gagggtcccc agggaggaaa 1380 
cgggcaccac ccgctttctt gctggttgct attttgcagt agagtcatct ccatcagctg 1440 
taagaagagc tgggaatata ggctctgcac agatggattt gcctgtgcca ccaccagggc 1500 
gaacgacaat agctttaccc tcaggcatag gcctgggtgc tggctgccca gacccctctg 15 60 
gccaggatgg aggggtggtc ctgactcaac atgttactga ccagcaactt gtctttttct 1620 
ggactgaagc ctgcaggagt taaaaagggc agggcatctc ctgtgcatgg gctcgaaggg 1680 
agagccagct cccccgaccg gtgggcattt gtgaggccca tggttgagaa atgaataatt 17 4 0 
tcccaattag gaagtgtaag cagctgaggt ctcttgaggg agcttagcca atgtgggagc 18 00 
agcggtttgg ggagcagaga cactaacgac ttcagggcag ggctctgata ttccatgaat 1860 
gtatcaggaa atatatatgt gtgtgtatgt ttgcacactt gtgtgtgggc tgtgagtgta 1920 
agtgtgagta agagctggtg tctgattgtt aagtctaaat atttccttaa actgtgtgga 1980 
ctgtgatgcc acacagagtg gtctttctgg agaggttata ggtcactcct ggggcctctt 2040 
gggtccccca cgtgacagtg cctgggaatg tattattctg cagcatgacc tgtgaccagc 2100 
actgtctcag tttcactttc acatagatgt ccctttcttg gccagttatc ccttcctttt 2160 
agcctagttc atccaatcct cactgggtgg ggtgaggacc actcctgtac actgaatatt 2220 
tatatttcac tatttttatt tatatttttg taattttaaa taaaagtgat caataaaatg 2280 
tgatttttct gatg 2294 

<210> 124 
<211> 956 
<212> DNA 
<213> Homo sapiens 



<400> 124 

gatgagttcc gcaccaagtt tgagacagac 
atcaatggcc tgcgcagggt gctggatgag 
cagattgaga acctcaagga ggagctggcc 
aacgccctgc gaggccaggt gggtggtgag 
gtggacctga gccgcatcct caacgagatg 
aaccgcaagg atgccgagga ttggttcttc 
gccaccaaca gtgagctggt gcagagtggc 
atgcaggcct tggagataga gctgcagtcc 
aacctggcgg agacagagaa ccgctactgc 
ggcagcgtgg aggagcagct ggcccagctt 
tacaaaatcc tgctggatgt gaagacgcgg 
ctgctggagg gagaggatgc ccacctgact 
caggtgcgta ccattgtgga agaggtccag 
gtccaccaga ccacccgctg aggactcagc 
cgcagccgcc ccatctgccc cacagtctcc 
tcccttcccc atgcttcctt gcctgatgac 

<210> 125 
<211> 486 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 16 

<223> n = A,T,C or G 



caggccctgc gcctgagtgt ggaggccgac 60 
ctgaccctgg ccagagccga cctggagatg 12 0 
tacctgaaga agaaccacga ggaggagatg 180 
atcaatgtgg agatggacgc tgccccaggc 240 
cgtgaccagt atgagaagat ggcagagaag 300 
agcaagacag aggaactgaa ccgcgaggtg 3 60 
aagagtgaga tctcggagct ccggcgcacc 420 
cagctcagca tgaaagcatc cctggagggc 480 
gtgcagctgt cccagatcca ggggctgatt 54 0 
cgctgcgaga tggagcagca gaaccaggaa 600 
ctggagcagg agattgccac ctaccgccgc 660 
cagtacaaga aagaaccggt gaccacccgt 72 0 
gatggcaagg tcatctcctc ccgcgagcag 780 
taccccggcc ggccacccag gaggcaggga 840 
ggcctctcca gcctcagccc cctgcttcag 900 
aataaaagct tgttgactca gctatg 95 6 



<400> 125 

aaattatata tagtgnttca gctcccattg tggtgttcat agtcttctag gaacagataa 60 
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acttaagtat tcaattcact cttggcattt tttctttaat ataggctttt tagcctattt 120 

ttggaaaact gcttttcttc tgagaacctt attctgaatg tcatcaactt taccaaacct 180 

tctaagtcca gagctaactt agtactgttt aagttactat tgactgaatt ttcttcattt 24 0 

tctgtttagc cagtgttacc aaggtaagct ggggaatgaa gtataccaac ttctttcaga 300 

gcattttagg acattatggc agctttagaa ggctgtcttg tttctagcca agggagagcc 360 

agcgcaggtt ttggatacta gagaaagtca tttgcttgta ctattgccat tttagaaagc 42 0 

tctgatgtga attcaaattt tacctctgtt acttaaagcc aacaatttta aggcagtagt 480 

tttact 486 

<210> 126 
<211> 3552 
<212> DNA 
<213> Homo sapiens 

<400> 126 

cggcaggcag gtctcgtctc ggcaccctcc cggcgcccgc gttctcctgg ccctgcccgg 60 
catcccgatg gccgccgctg ggccccggcg ctccgtgcgc ggagccgtct gcctgcatct 120 
gctgctgacc ctcgtgatct tcagtcgtgc tggtgaagcc tgcaaaaagg tgatacttaa 180 
tgtaccttct aaactagagg cagacaaaat aattggcaga gttaatttgg aagagtgctt 240 
caggtctgca gacctcatcc ggtcaagtga tcctgatttc agagttctaa atgatgggtc 300 
agtgtacaca gccagggctg ttgcgctgtc tgataagaaa agatcattta ccatatggct 360 
ttctgacaaa aggaaacaga cacagaaaga ggttactgtg ctgctagaac atcagaagaa 42 0 
ggtatcgaag acaagacaca ctagagaaac tgttctcagg cgtgccaaga ggagatgggc 480 
acctattcct tgctctatgc aagagaattc cttgggccct ttcccattgt ttcttcaaca 540 
agttgaatct gatgcagcac agaactatac tgtcttctac tcaataagtg gacgtggagt 600 
tgataaagaa cctttaaatt tgttttatat agaaagagac actggaaatc tattttgcac 660 
tcggcctgtg gatcgtgaag aatatgatgt ttttgatttg attgcttatg cgtcaactgc 720 
agatggatat tcagcagatc tgcccctccc actacccatc agggtagagg atgaaaatga 780 
caaccaccct gttttcacag aagcaattta taattttgaa gttttggaaa gtagtagacc 84 0 
tggtactaca gtgggggtgg tttgtgccac agacagagat gaaccggaca caatgcatac 900 
gcgcctgaaa tacagcattt tgcagcagac accaaggtca cctgggctct tttctgtgca 960 
tcccagcaca ggcgtaatca ccacagtctc tcattatttg gacagagagg ttgtagacaa 1020 
gtactcattg ataatgaaag tacaagacat ggatggccag ttttttggat tgataggcac 108 0 
atcaacttgt atcataacag taacagattc aaatgataat gcacccactt tcagacaaaa 1140 
tgcttatgaa gcatttgtag aggaaaatgc attcaatgtg gaaatcttac gaatacctat 1200 
agaagataag gatttaatta acactgccaa ttggagagtc aattttacca ttttaaaggg 12 60 
aaatgaaaat ggacatttca aaatcagcac agacaaagaa actaatgaag gtgttctttc 1320 
tgttgtaaag ccactgaatt atgaagaaaa ccgtcaagtg aacctggaaa ttggagtaaa 1380 
caatgaagcg ccatttgcta gagatattcc cagagtgaca gccttgaaca gagccttggt 1440 
tacagttcat gtgagggatc tggatgaggg gcctgaatgc actcctgcag cccaatatgt 1500 
gcggattaaa gaaaacttag cagtggggtc aaagatcaac ggctataagg catatgaccc 15 60 
cgaaaataga aatggcaatg gtttaaggta caaaaaattg catgatccta aaggttggat 1620 
caccattgat gaaatttcag ggtcaatcat aacttccaaa atcctggata gggaggttga 1680 
aactcccaaa aatgagttgt ataatattac agtcctggca atagacaaag atgatagatc 1740 
atgtactgga acacttgctg tgaacattga agatgtaaat gataatccac cagaaatact 1800 
tcaagaatat gtagtcattt gcaaaccaaa aatggggtat accgacattt tagctgttga 1860 
tcctgatgaa cctgtccatg gagctccatt ttatttcagt ttgcccaata cttctccaga 1920 
aatcagtaga ctgtggagcc tcaccaaagt taatgataca gctgcccgtc tttcatatca 1980 
gaaaaatgct ggatttcaag aatataccat tcctattact gtaaaagaca gggccggcca 2040 
agctgcaaca aaattattga gagttaatct gtgtgaatgt actcatccaa ctcagtgtcg 2100 
tgcgacttca aggagtacag gagtaatact tggaaaatgg gcaatccttg caatattact 2160 
gggtatagca ctgctctttt ctgtattgct aactttagta tgtggagttt ttggtgcaac 2220 
taaagggaaa cgttttcctg aagatttagc acagcaaaac ttaattatat caaacacaga 2280 
agcacctgga gacgatagag tgtgctctgc caatggattt atgacccaaa ctaccaacaa 2340 
ctctagccaa ggtttttgtg gtactatggg atcaggaatg aaaaatggag ggcaggaaac 2 4 00 
cattgaaatg atgaaaggag gaaaccagac cttggaatcc tgccgggggg ctgggcatca 2460 
tcataccctg gactcctgca ggggaggaca cacggaggtg gacaactgca gatacactta 2520 
ctcggagtgg cacagtttta ctcaaccccg tctcggtgaa aaattgcatc gatgtaatca 2580 
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gaatgaagac cgcatgccat cccaagatta 
atctccagct ggttctgtgg gctgctgcag 
tttaaataat ttggaaccca aatttattac 
tcacagtgct acaattaggt ctttgtcaga 
aaagttcaat ttcaacatgt atgtatatga 
ctcaccaatt tatattttta aagcaagttg 
ttaaaacaga caactggtaa atctcaaact 
ctgctctttt ttttttttac agatatttta 
aacaatagct aagttatgct aatatcacat 
aaaaataaac aagaaatatt gagtatcact 
gaagactgaa ttaaattaaa aatgttgcag 
actaccaaat tcatttgact ttggaggcaa 
ttttctatag gaatatagtt ggaaataaat 
aatatttaaa tgaaatgaga acaaagagga 
tatagtttgt cctacaatag aaaaaagaga 
gcattataac tgagtctatg aggaaatagt 
ttgtaaataa at 



tgtcctcact tataactatg agggaagagg 2 64 0 
tgaaaagcag gaagaagatg gccttgactt 27 00 
attagcagaa gcatgcacaa agagataatg 27 60 
cattctggag gtttccaaaa ataatattgt 2820 
tgattttttt ctcaattttg aattatgcta 2880 
ttgcttatct tttccaaaaa gtgaaaaatg 2940 
ccagcactgg aattaaggtc tctaaagcat 3000 
gtaataaata tgctggataa atattagtcc 30 60 
tattatgtat tcactttaag tgatagttta 3120 
atgtgaagaa agttttggaa aagaaacaat 3180 
ctcataaaga attggactca cccctactgc 3240 
aatgtgttga agtgccctat gaagtagcaa 3300 
gtgtgtgtgt atattattat taatcaatgc 33 60 
aaatggtaaa aacttgaaat gaggctgggg 3420 
gagcttccta ggcctgggct cttaaatgct 34 80 
tcctgtccaa tttgtgtaat ttgtttaaaa 3540 
3552 



<210> 127 
<211> 754 
<212> DNA 
<213> Homo sapiens 

<400> 127 

tttttttttt ttgtcattgt tcattgattt taatgagaaa gctaagagag gaaataagta 60 
gcctttcaaa ggtcacacag aagtaagtga cagatccagg attcatatcc aagcattctg 12 0 
gctctagtgt ccatgcttct caaccattat gacccaatat tcaaccaaat caatactgaa 180 
ggacacgtga aatgtatccg gtattttact attacaaaca aaaatccaat gaacattctt 240 
gaagacatac acaaaaataa tggttacaat agaagttact ggaattgaaa ttttggttca 300 
acctatatta aaatgtaagg cttttgatat agctaataga tttttgaaat gatcagtctt 360 
aacgtttgta ggggagcaca ctcctgcatg gggaaaagat tcactgtgaa gcacagagca 420 
cctttatggt tggatcatct tgtcattaaa gttcaggcgt tatctatcct gtaagtggca 480 
gaatcaagac tgcaatatcg cctgcttttc tttttaactc atgttttccc ttgactacac 54 0 
tggtcctcaa agtaaaaccc ctgtgtcagt gtactattca tggaatactc tgcaattata 600 
accaccttct aatactttta atacccaatc aaaatttatt atacatatgt atcatagata 660 
ctcatctgta aagctgtgct tcaaaatagt gatctcttcc caacattaca atatatatta 720 
atgatgtcga acctgcccgg gcggccgctc gaag 754 

<210> 128 
<211> 374 
<212> DNA 
<213> Homo sapiens 

<400> 128 

aggttttgat taaaaaggca aatgatttta ttgttcgata atcttttaaa aaaataagag 60 
gaaggagtaa aattaaagat gaaagatgat ttttatttcc ttgtgacctc tatatccccc 12 0 
ttcccctgcc cttggtaagt aactcttgat ggagaaagga ttaaagactc ttatttaacc 180 
aaaaaacaga gccagctaat catttccaaa ggttagtatc tccctgctga cctcttcttt 240 
ggtttaattg aataaaacta tatgttcata tatgtattaa aacaactcag aataacatct 300 
tttcttcctt agttaaggca ttataagggc tatactatca tccataataa ccaaggcaat 360 
aacttaaaaa gctg 37 4 

<210> 129 
<211> 546 
<212> DNA 
<213> Homo sapiens 



<400> 129 
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agtgtgatgg atatctgcag aattcgggct 
tcccagcacy tgaaaaggag cctcctgagc 
cctcatttct gcctactgat ttccttggag 
aacctggtac atacatagca tgactccctg 
gagagtgatt gacatgcact ttcaagctat 
acctcgagta aattccatca ttttttataa 
tcagcgtaac aggatctcca gtctctggct 
tgggataaaa tccctgtttc acattggcat 
tgtctctttc cacaaaggct tccacagtgg 



aagcgtggtc gcggcccgag gtctggaact 60 
tgactcggct aaagccccac tttcgctcct 120 
cattcatctg aatattaccg tttgctgtgt 18 0 
gaatagagtg ggctggggtg cttatgctgg 24 0 
atctaccatt tgcagcaaag gagaaaaaat 300 
catcagcacc tgctccatca tcaaggagtc 360 
caactgtggc agtgacagtg gcattaagaa 42 0 
aaatcatcac aggatgagga aaatggaggc 480 
ctgggggcac agacctgccc gggcggccgc 54 0 
54 6 



<210> 130 
<211> 5156 
<212> DNA 

<213> Homo sapiens 



<400> 130 

accaaccgag gcgccgggca gcgacccctg 
ccgccatgcc tgcgctctgg ctgggctgct 
cccgggccac ctccaggagg gaagtctgtg 
ttgatcggga acttcacaga caaactggta 
acactgatgg cattcactgc gagaagtgca 
accgctgttt gccctgcaat tgtaactcca 
ccggacggtg cagctgtaaa ccaggtgtga 
gcttccacat gctcacggat gcggggtgca 
gtgactgtga cccagctggc atcgcagggc 
cagctgtcac tggagaacgc tgtgataggt 
ggaaccctga gggctgtacc cagtgtttct 
ctgcagaata cagtgtccat aagatcacct 
aggctgtcca acgaaatggg tctcctgcaa 
tgtttagctc agcccaacga ctagaccctg 
ggaatcaaca ggtgagctat ggtcaaagcc 
gcagacaccc atctgcccat gatgtgattc 
ccttgatgcc acttggcaag acactgcctt 
taaatgagca tccaagcaat aattggagcc 
tactgcggaa tctcacagcc ctccgcatcc 
acattgacaa tgtgaccctg atttcagccc 
ttgaacagtg tatatgtcct gttgggtaca 
gctacaagag agattcagcg agactggggc 
aagggggagg ggcctgtgat ccagacacag 
acattgagtg tgctgactgc ccaattggtt 
gcaagccatg tccctgtcat aacgggttca 
tggtgtgcaa taactgccct cccggggtca 
gctactttgg ggaccccttt ggtgaacatg 
gcaacaacaa tgtggacccc agtgcctctg 
tgaagtgtat ccacaacaca gccggcatct 
gggacccatt ggctcccaac ccagcagaca 
gctcagagcc tgtaggatgt cgaagtgatg 
gccccaactg tgagcatgga gcattcagct 
agatggatca gtttatgcag cagcttcaga 
gtggtgatgg agtagtacct gatacagagc 
cccttcagga cattctgaga gatgcccaga 
tccagttggc caaggtgagg agccaagaga 
agatgactgt ggaaagagtt cgggctctgg 
ctcacaggct catcactcag atgcagctga 
acactaacat tcctgcctca gaccactacg 
aggaggccac aagattagca gaaagccacg 
caagggaaac tgaggactat tccaaacaag 



cagcggagac agagactgag cggcccggca 60 
gcctctgctt gtcgctcctc ctgcccgcag 12 0 
attgcaatgg gaagtccagg cagtgtatct 180 
atggattccg ctgcctcaac tgcaatgaca 24 0 
agaatggctt ttaccggcac agagaaaggg 300 
aaggttctct tagtgctcga tgtgacaact 360 
caggagccag atgcgaccga tgtctgccag 420 
cccaagacca gagactgcta gactccaagt 480 
cctgtgacgc gggccgctgt gtctgcaagc 54 0 
gtcgatcagg ttactataat ctggatgggg 600 
gctatgggca ttcagccagc tgccgcagct 660 
ctacctttca tcaagatgtt gatggctgga 720 
agctccaatg gtcacagcgc catcaagatg 780 
tctattttgt ggctcctgcc aaatttcttg 840 
tgtcctttga ctaccgtgtg gacagaggag 900 
tggaaggtgc tggtctacgg atcacagctc 960 
gtgggctcac caagacttac acattcaggt 1020 
cccagctgag ttactttgag tatcgaaggt 1080 
gagctacata tggagaatac agtactgggt 114 0 
gccctgtctc tggagcccca gcaccctggg 12 00 
aggggcaatt ctgccaggat tgtgcttctg 12 60 
cttttggcac ctgtattcct tgtaactgtc 1320 
gagattgtta ttcaggggat gagaatcctg 1380 
tctacaacga tccgcacgac ccccgcagct 1440 
gctgctcagt gatgccggag acggaggagg 1500 
ccggtgcccg ctgtgagctc tgtgctgatg 1560 
gcccagtgag gccttgtcag ccctgtcaat 1620 
ggaattgtga ccggctgaca ggcaggtgtt 1680 
actgcgacca gtgcaaagca ggctacttcg 17 4 0 
agtgtcgagc ttgcaactgt aaccccatgg 1800 
gcacctgtgt ttgcaagcca ggatttggtg 18 60 
gtccagcttg ctataatcaa gtgaagattc 1920 
gaatggaggc cctgatttca aaggctcagg 1980 
tggaaggcag gatgcagcag gctgagcagg 2040 
tttcagaagg tgctagcaga tcccttggtc 2100 
acagctacca gagccgcctg gatgacctca 2160 
gaagtcagta ccagaaccga gttcgggata 222 0 
gcctggcaga aagtgaagct tccttgggaa 2280 
tggggccaaa tggctttaaa agtctggctc 2340 
ttgagtcagc cagtaacatg gagcaactga 2 4 00 
ccctctcact ggtgcgcaag gccctgcatg 24 60 
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aaggagtcgg aagcggaagc ggtagcccgg 
aattggagaa aaccaagtcc ctggcccagc 
ttgaagcaga taggtcttat cagcacagtc 
agggagtcag tgatcagtcc tttcaggtgg 
attcactctc aagcctggta accaggcata 
tgggaaactg gaaagaagaa gcacagcagc 
aatcagatca gctgctttcc cgtgccaatc 
gtatgggcaa tgccactttt tatgaagttg 
acctgcaggt ggacaacaga aaagcagaag 
tcagccagaa ggtttcagat gccagtgaca 
gcgctgctgc tgatgcacag agggcaaaga 
gtgagattga acaggagatt gggagtctga 
ccttggccat ggaaaaggga ctggcctctc 
agctggaaag gaaggagctg gagtttgaca 
cagaagccca gaaggttgat accagagcca 
tcaacacatt agacggcctc ctgcatctga 
ggctggtctt actggagcag aagctttccc 
ggcccatgat gtcagagctg gaagagaggg 
tggagacaag catagatggg attctggctg 
acctgccccc aggctgctac aatacccagg 
atttctcaac tgaggttctt gggatacaga 
tgggtgggat ggggacattt gaacatgttt 
cccattcctg atcccatggc caggtggttg 
tgctgggcaa tgaggcagat agcactgggt 
gaatagactg gatggaaaga caaactgcac 
gtggagtcct ggaatttgga caagtgctgt 
tgtgactaaa ggaaaaaact ttgactttgc 
cagagtgcaa cccagtcaca ctgtggccag 
caagcttctt gctgatcaga gttcctccta 
attttcaagc tggaagaagt gagcagtgtt 
agagctatgg tgcttgctgg tgcctgccac 
tttcttttaa tgatgccatg gcaacttaga 
agcaaagcaa atgttgggaa agtatttact 
gcttgggcat tgaaagaggt aaaattctct 
ttagaacacc aaaaatgatg cgcatcaatg 
tctttcctcc acccataata agagaatgtt 
tccctccatt catccttcca tccatctttc 
tatatttatt gagtacctac tgtgtgccag 
tgccctcata gagttgattg tctagtgagg 
acttacaaac tttgtttgtc acaagtggtg 
tctttgctca acagaacata tgttgcaaga 
aggctgacag agctctgggt tgtgcacatt 
ttctacaact gattgcaaca gactgttgag 
gaaccagagg cacttccacc ttggctggga 
ccttggattt tcctgaaagt gtttttaaat 



acggtgctgt ggtgcaaggg cttgtggaaa 2520 
agttgacaag ggaggccact caagcggaaa 2580 
tccgcctcct ggattcagtg tctcggcttc 2 640 
aagaagcaaa gaggatcaaa caaaaagcgg 2700 
tggatgagtt caagcgtaca cagaagaatc 27 60 
tcttacagaa tggaaaaagt gggagagaga 2820 
ttgctaaaag cagagcacaa gaagcactga 2 8 80 
agagcatcct taaaaacctc agagagtttg 2940 
ctgaagaagc catgaagaga ctctcctaca 3000 
agacccagca agcagaaaga gccctgggga 3060 
atggggccgg ggaggccctg gaaatctcca 3120 
acttggaagc caatgtgaca gcagatggag 3180 
tgaagagtga gatgagggaa gtggaaggag 3240 
cgaatatgga tgcagtacag atggtgatta 3300 
agaacgctgg ggttacaatc caagacacac 3360 
tggaccagcc tctcagtgta gatgaagagg 3420 
gagccaagac ccagatcaac agccaactgc 34 8 0 
cacgtcagca gaggggccac ctccatttgc 3540 
atgtgaagaa cttggagaac attagggaca 3600 
ctcttgagca acagtgaagc tgccataaat 3660 
tctcagggct cgggagccat gtcatgtgag 3720 
aatgggtatg ctcaggtcaa ctgacctgac 3780 
tcttattgca ccatactcct tgcttcctga 3840 
gtgagaatga tcaaggatct ggaccccaaa 3900 
aggcagatgt ttgcctcata atagtcgtaa 3960 
tgggatatag tcaacttatt ctttgagtaa 4020 
ccaggcatga aattcttcct aatgtcagaa 4080 
taaaatacta ttgcctcata ttgtcctctg 4140 
cttacaaccc agggtgtgaa catgttctcc 4200 
ggagtgagga cctgtaaggc aggcccattc 42 60 
cttcaagttc tggacctggg catgacatcc 4320 
gattgcattt ttattaaagc atttcctacc 4380 
ttttcggttt caaagtgata gaaaagtgtg 44 40 
agatttatta gtcctaattc aatcctactt 4500 
tattttatct tattttctca atctcctctc 4560 
cctactcaca cttcagctgg gtcacatcca 4620 
catccattac ctccatccat ccttccaaca 4 680 
gggctggtgg gacagtggtg acatagtctc 4740 
aagacaagca tttttaaaaa ataaatttaa 4800 
tttattgcaa taaccgcttg gtttgcaacc 48 60 
ccctcccatg ggggcacttg agttttggca 4 92 0 
tctttgcatt ccagctgtca ctctgtgcct 4980 
ttatgataac accagtggga attgctggag 5040 
agactatggt gctgccttgc ttctgtattt 5100 
aaagaacaat tgttagaaaa aaaaaa 5156 



<210> 131 
<211> 671 
<212> DNA 
<213> Homo sapiens 

<400> 131 

aggtctggag ggcccacagc cggatgtggg 
ttttgcatcc cggttgcagt gtgttgcaga 
cctgggcagc caycacgagg atcatgactc 
tcccgatgct ggtggagtgt ttgttgacac 



acaccgggaa aaagtggtca tagcacacat 60 
cgaagtcctc ttgctcgtca ccccacactt 120 
ggaaaataaa gatgactgtg atccacacct 18 0 
ccccgatgaa agtgtgcagc gtcccccaat 240 
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ccattgcgct ggtttatccc tgagtcctgt ttccaacgac tgccagtgtt tcagacccaa 300 
agaatgaggg caagatccct ctgcgagggt ttcagacctc cttctcctac cccactggag 360 
tgcctagaag ccaatgggtg cacagtgatg atacgaatgt caatctttgc tcggtcagtg 420 
aggatgtcgc ctggaatatt caaattgaat tacagatgca tgaagagggc gtacaagtta 4 80 
gaatttttct ttcgccatac agaaattgtt tagccagatc ttctgtactt cttttccttc 540 
cctgaccctt cctgctcccc aggaagggag gtcagccccg tttgcaaaac acaggatgcc 600 
cgtgacaccg gagacaggtc ttcttcaccg acaggaagtg ccttctggtg cctgcacgtt 660 
ttaactgcta t 671 

<210> 132 
<211> 590 
<212> DNA 
<213> Homo sapiens 

<400> 132 

ctgaatggaa aagcttatgg ctctgtgatg atattagtga ccagcggaga tgataagctt 60 
cttggcaatt gcttacccac tgtgctcagc agtggttcaa caattcactc cattgccctg 12 0 
ggttcatctg cagccccaaa tctggaggaa ttatcacgtc ttacaggagg tttaaagttc 180 
tttgttccag atatatcaaa ctccaatagc atgattgatg ctttcagtag aatttcctct 240 
ggaactggag acattttcca gcaacatatt cagcttgaaa gtacaggtga aaatgtcaaa 300 
cctcaccatc aattgaaaaa cacagtgact gtggataata ctgtgggcaa cgacactatg 360 
tttctagtta cgtggcaggc cagtggtcct cctgagatta tattatttga tcctgatgga 42 0 
cgaaaatact acacaaataa ttttatcacc aatctaactt ttcggacagc tagtctttgg 480 
attccaggaa cagctaagcc tgggcactgg acttacaccc tgaacaatac ccatcattct 540 
ctgcaagccc tgaaagtgac agtgacctct cgcgcctcca actcagacct 5 90 

<210> 133 
<211> 581 
<212> DNA 
<213> Homo sapiens 

<400> 133 

aggtcctgtc cgggggcact gagaactccc tctggaattc ttggggggtg ttggggagag 60 
actgtgggcc tggagataaa acttgtctcc tctaccacca ccctgtaccc tagcctgcac 12 0 
ctgtcctcat ctctgcaaag ttcagcttcc ttccccaggt ctctgtgcac tctgtcttgg 180 
atgctctggg gagctcatgg gtggaggagt ctccaccaga gggaggctca ggggactggt 240 
tgggccaggg atgaatattt gagggataaa aattgtgtaa gagccaaaga attggtagta 300 
gggggagaac agagaggagc tgggctatgg gaaatgattt gaataatgga gctgggaata 360 
tggctggata tctggtacta aaaaagggtc tttaagaacc tacttcctaa tctcttcccc 420 
aatccaaacc atagctgtct gtccagtgct ctcttcctgc ctccagctct gccccaggct 480 
cctcctagac tctgtccctg ggctagggca ggggaggagg gagagcaggg ttgggggaga 54 0 
ggctgaggag agtgtgacat gtggggagag gaccagacct c 581 

<210> 134 
<211> 4797 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 135, 501, 4421, 4467, 4468, 4698 
<223> n = A,T,C or G 

<400> 134 

cctgggacca aagtgctgcc cagagctgag ggtcctggag ccacatgaga aggcttctcc 60 
ctgtgtacct gtgcagcaca gggtagggtg agtccactca gctgtctagg agaggaccca 12 0 
ggagcagcag agacncgcca agcctttact cataccatat tctgatcctt ttccagcaaa 180 
ttgtggctac taatttgccc cctgaagatc aagatggctc tggggatgac tctgacaact 24 0 
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tctccggctc aggtgcaggt gaggttgtca 
ggtcatgcct gggggcagtg gtcaggcagt 
caccctgcct gccctgtctc cacccagctg 
ggggtcgtga gctgctgtta aggagagctc 
tccagaaggc cccagggcgc nctgctgcac 
ctcactaaga aacctctgga acccccttca 
tctcatctgc aaaatgggaa taataccttg 
cacagagcca gctggggtgt agctcttcca 
tgtggggact gggggagaga agtccctgag 
aggtggctct tgagtggacc tcaggaagag 
tcatccctgg ggaagtgacc tagcggaggc 
tggaagtgtc tgttgttgga agtgggggcc 
tgtgtgccct gggataagta ggataaccac 
cacccctgtg gtcacagaaa agctttccca 
gagacaggct gcacctgaca cacacaatgg 
gagcttagcc tcagctgcct tgtccaggta 
gcccaggtgc tctggagcct cccccgaccc 
accccccacc tccccaacac actctgcttc 
cttgtcacag cagaccccct ccacttggaa 
gtctccagaa cccaccggcc tggaggctac 
agaggggccc aaggagggag aggctgtagt 
ccgggagcag gaggccaccc cccgacccag 
ggcctcaacg accacagcca ccacggccca 
catgcagcct ggccaccatg agacctcaac 
cactccccac acagaggatg gaggtccttc 
ctccagtcag ctcccagcag cagagggctc 
ttgggaaatt gagtgggttg gtcctaatgc 
ctgcgcgatc tcgtattcct caccaggaag 
ccagggcctc gcagagcagg acagactaac 
atcacccaag agagggctcc caaactcaca 
aacgttatac cagtcatttt atttatagct 
gctattcata caaaatgtgt gctttgtatc 
ccagggtccg gagttgatgt ggcaagaagg 
ttgggtgcat ctgagtgggt ggtggcaaag 
gtagtggagc tggttgctgc tgctggcggt 
cccacaggac ttcacctttg aaacctcggg 
tgaccgccgg aaccagtccc cagtggatca 
ggacaggaaa gaggtgctgg gaggtgagtt 
gctgctgtgg ggtcagggtg gggctgacca 
cacgagagcc caaggagccg ctgagctgag 
tgccggaggc ctcgtggggc tcatctttgc 
catgaagaag aaggacgaag gcagctactc 
ggcctaccag aagcccacca aacaggagga 
ccctccgccc tgccactcac taggccccca 
tggcctcccc tgccaccagg ccacctcccc 
cacggagtcg tgggtgtgct gggagctcca 
tagggcacca ggggtttctc gcataggacc 
ccattctgac tcggtttctc caaactgaag 
gagggggatc cgactgcttt ggacctaaat 
ggggcttggg gctcacacac ctgtagcact 
ggccgctgag tggcagggga caggagtcac 
tcgacttgtt tttgcacatg tttcctctag 
acttctgagg taagttaagt aagttgattc 
tggtcgggag acagcatcag ggttaagaag 
ccaaatctgg aagccaaaat gtaggcttag 
atgtgtgcaa cagggtatgg actatctgtc 
ggctggccag tccaggctgc cgtggggccg 
catgcgctca gggccatgct gaggcctggg 



tgggggcccc ccccacccaa gacggcaaca 300 
ctcctgtgtt tactgagcat gtactgagtg 360 
gctccaaagg' gcaatgctga ggagaggaat 420 
atgcttggag gtgaggtgaa ggctgtgagc 48 0 
gcaggctcat attcactagg aatagcttta 54 0 
gaaggttatt tgactcctga gcctctattt 600 
acctgataag cttgtggagc tgtaaggcag 660 
tccaagctcc cttccttact tcccctttcc 72 0 
ctggaggtgg tcagggaagc ttcacagagg 780 
gggtgagaga gctaaggaag gaggctgagg 840 
ctgagagctg caaggtagga tatctgttgt 900 
tttttttcag ggagggtggg gccagagaag 960 
agtagttatg cccctaaggg atgcccaccc 102 0 
ggtggcctag gcacctgtct cgtggctcca 1080 
aaggacagct ctccttgtcc attttccaag 1140 
ctagcctccc tcatagcctg agcttggcca 1200 
acccaacaca ctctgcttct ggtcctcccc 12 60 
tggtcctgca ggtgctttgc aagatatcac 1320 
ggacacgcag ctcctgacgg ctattcccac 13 8 0 
agctgcctcc acctccaccc tgccggctgg 14 40 
cctgccagaa gtggagcctg gcctcaccgc 1500 
ggagaccaca cagctcccga ccactcatca 1560 
ggagcccgcc acctcccacc cccacaggga 1620 
ccctgcagga cccagccaag ctgaccttca 1680 
tgccaccgag agggctgctg aggatggagc 17 40 
tggggagcag gtgagtggcc tctgcattcc 1800 
ctggcacttg gcaggcccta cacctgtgcc 18 60 
acagggcaca ggggccgcct tcccctaccc 1920 
tatgagatca gagcagaagc acccttaaag 1980 
atccaaactt gcagccctcg tcgaagagtg 2040 
tcgtggattt acgcttacac taaatagtct 2100 
actttttgtg atatccatgc catggtccag 2160 
cctggctttc gggccctgtg cgatcctggt 2220 
atcagggagg caggagctgc ttctgggtct 2280 
gacctggcca acccaatctg cccctgccct 2340 
ggagaatacg gctgtagtgg ccgtggagcc 2400 
gggggccacg ggggcctcac agggcctcct 24 60 
ttctttcagg ggggtagttt ggggtgaatt 2520 
cagccaaggc cactgctttg ggagggtctg 2580 
ctggccccgt ctacctgccc taggggtcat 2640 
tgtgtgcctg gtgggtttca tgctgtaccg 2700 
cttggaggag ccgaaacaag ccaacggcgg 27 60 
attctatgcc tgacgcggga gccatgcgcc 2820 
cttgcctctt ccttgaagaa ctgcaggccc 2880 
agcattccag cccctctggt cgctcctgcc 2940 
ctctgcttct ctgacttctg cctggagact 3000 
tttccaccac agccagcacc tggcatcgca 3060 
cagcctctcc ccaggtccag ctctggaggg 3120 
ggcctcatgt ggctggaaga tcctgcgggt 3180 
tactggtagg accaagcatc ttgggggggt 3240 
tttgtttcgt ggggaggtct aatctagata 3300 
ttctttgttc atagcccagt agaccttgtt 3360 
ggtatccccc catcttgctt ccctaatcta 342 0 
actttttttt ttttttttaa actaggagaa 3480 
tttgtgtgtt gtctcttgag tttgtcgctc 3540 
tggtggcccc gttctggtgg tctgttggca 3600 
ccgcctcttt caagcagtcg tgcctgtgtc 3660 
ccgctgccac gttggagaag cccgtgtgag 3720 
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aagtgaatgc tgggactcag ccttcagaca gagaggactg tagggagggc ggcaggggcc 3780 
tggagatcct cctgcaggct cacgcccgtc ctcctgtggc gccgtctcca ggggctgctt 38 40 
cctcctggaa attgacgagg ggtgtcttgg gcagagctgg ctctgagcgc ctccatccaa 3900 
ggccaggttc tccgttagct cctgtggccc caccctgggc cctgggctgg aatcaggaat 3960 
attttccaaa gagtgatagt cttttgcttt tggcaaaact ctacttaatc caatgggttt 4 02 0 
ttccctgtac agtagatttt ccaaatgtaa taaactttaa tataaagtag tctgtgaatg 4080 
ccactgcctt cgcttcttgc ctctgtgctg tgtgtgacgt gaccggactt ttctgcaaac 4140 
accaacatgt tgggaaactt ggctcgaatc tctgtgcctt cgtctttccc atggggaggg 42 00 
attctggttc cagggtccct ctgtgtattt gcttttttgt tttggctgaa attctcctgg 42 60 
aggtcggtag gttcagccaa ggttttataa ggctgatgtc aatttctgtg ttgccaagct 432 0 
ccaagcccat cttctaaatg gcaaaggaag gtggatggcc ccagcacagc ttgacctgag 4380 
gctgtggtca cagcggaggt gtggagccga ggcctacccc ncagacacct tggacatcct 4440 
cctcccaccc ggctgcagag gccaganncc agcccagggt cctgcactta cttgcttatt 4500 
tgacaacgtt tcagcgactc cgttggccac tccgagagtg ggccagtctg tggatcagag 4560 
atgcaccacc aagccaaggg aacctgtgtc cggtattcga tactgcgact ttctgcctgg 4 62 0 
agtgtatgac tgcacatgac tcgggggtgg ggaaaggggt cggctgacca tgctcatctg 4 680 
ctggtccgtg ggacggtncc caagccagag gtgggttcat ttgtgtaacg acaataaacg 4740 
gtacttgtca tttcgggcaa cggctgctgt ggtggtggtt gagtctcttc ttggcct 4797 

<210> 135 
<211> 2856 
<212> DNA 

<213> Homo sapiens 
<400> 135 

tagtcgcggg tccccgagtg agcacgccag ggagcaggag accaaacgac gggggtcgga 60 
gtcagagtcg cagtgggagt ccccggaccg gagcacgagc ctgagcggga gagcgccgct 120 
cgcacgcccg tcgccacccg cgtacccggc gcagccagag ccaccagcgc agcgctgcca 18 0 
tggagcccag cagcaagaag ctgacgggtc gcctcatgct ggctgtggga ggagcagtgc 240 
ttggctccct gcagtttggc tacaacactg gagtcatcaa tgccccccag aaggtgatcg 300 
aggagttcta caaccagaca tgggtccacc gctatgggga gagcatcctg cccaccacgc 360 
tcaccacgct ctggtccctc tcagtggcca tcttttctgt tgggggcatg attggctcct 42 0 
tctctgtggg ccttttcgtt aaccgctttg gccggcggaa ttcaatgctg atgatgaacc 480 
tgctggcctt cgtgtccgcc gtgctcatgg gcttctcgaa actgggcaag tcctttgaga 540 
tgctgatcct gggccgcttc atcatcggtg tgtactgcgg cctgaccaca ggcttcgtgc 600 
ccatgtatgt gggtgaagtg tcacccacag cctttcgtgg ggccctgggc accctgcacc 660 
agctgggcat cgtcgtcggc atcctcatcg cccaggtgtt cggcctggac tccatcatgg 720 
gcaacaagga cctgtggccc ctgctgctga gcatcatctt catcccggcc ctgctgcagt 780 
gcatcgtgct gcccttctgc cccgagagtc cccgcttcct gctcatcaac cgcaacgagg 840 
agaaccgggc caagagtgtg ctaaagaagc tgcgcgggac agctgacgtg acccatgacc 900 
tgcaggagat gaaggaagag agtcggcaga tgatgcggga gaagaaggtc accatcctgg 960 
agctgttccg ctcccccgcc taccgccagc ccatcctcat cgctgtggtg ctgcagctgt 1020 
cccagcagct gtctggcatc aacgctgtct tctattactc cacgagcatc ttcgagaagg 1080 
cgggggtgca gcagcctgtg tatgccacca ttggctccgg tatcgtcaac acggccttca 1140 
ctgtcgtgtc gctgtttgtg gtggagcgag caggccggcg gaccctgcac ctcataggcc 12 00 
tcgctggcat ggcgggttgt gccatactca tgaccatcgc gctagcactg ctggagcagc 12 60 
taccctggat gtcctatctg agcatcgtgg ccatctttgg ctttgtggcc ttctttgaag 1320 
tgggtcctgg ccccatccca tggttcatcg tggctgaact cttcagccag ggtccacgtc 1380 
cagctgccat tgccgttgca ggcttctcca actggacctc aaatttcatt gtgggcatgt 1440 
gcttccagta tgtggagcaa ctgtgtggtc cctacgtctt catcatcttc actgtgctcc 1500 
tggttctgtt cttcatcttc acctacttca aagttcctga gactaaaggc cggaccttcg 1560 
atgagatcgc ttccggcttc cggcaggggg gagccagcca aagtgataag acacccgagg 1620 
agctgttcca tcccctgggg gctgattccc aagtgtgagt cgccccagat caccagcccg 1680 
gcctgctccc agcagcccta aggatctctc aggagcacag gcagctggat gagacttcca 1740 
aacctgacag atgtcagccg agccgggcct ggggctcctt tctccagcca gcaatgatgt 1800 
ccagaagaat attcaggact taacggctcc aggattttaa caaaagcaag actgttgctc 18 60 
aaatctattc agacaagcaa caggttttat aattttttta ttactgattt tgttattttt 1920 
atatcagcct gagtctcctg tgcccacatc ccaggcttca ccctgaatgg ttccatgcct 1980 
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gagggtggag actaagccct gtcgagacac 
ctggacctat gtcctaagga cacactaatc 
gaggtggcta tggccacccg ttctgctggc 
cattaggatt tgccccttcc catctcttcc 
cctgagacca gttgggagca ctggagtgca 
gccgggttct agtctccttt gcactgaggg 
gggagcctgc aaactcactg ctcaagaaga 
tgcaagatat ttatatatat ttttggttgt 
atatctggac aagccaactt gtaaatacac 
tataaatggc tggtttttag aaacatggtt 
tttggatggg agtgagacag aagtaagtgg 
gactcaggat ccagtccctt acacgtacct 
tttgatccct gttacccaga gaatatatac 
atcacatatt tgatagttgg tgttcaaaaa 
aggcttgaaa tcgcattatt ttgaatgtga 

<210> 136 
<211> 356 
<212> DNA 
<213> Homo sapiens 



ttgccttctt cacccagcta atctgtaggg 2 04 0 
gaactatgaa ctacaaagct tctatcccag 2100 
ctggatctcc ccactctagg ggtcaggctc 2160 
tacccaacca ctcaaattaa tctttcttta 2220 
gggaggagag gggaagggcc agtctgggct 2280 
ccacactatt accatgagaa gagggcctgt 2340 
catggagact cctgccctgt tgtgtataga 24 00 
caatattaaa tacagacact aagttatagt 24 60 
cacctcactc ctgttactta cctaaacaga 2520 
ttgaaatgct tgtggattga gggtaggagg 2580 
ggttgcaacc actgcaacgg cttagacttc 2 64 0 
ctcatcagtg tcctcttgct caaaaatctg 27 00 
attctttatc ttgacattca aggcatttct 27 60 
aacactagtt ttgtgccagc cgtgatgctc 2820 
agggaa 2 85 6 



<400> 136 

ggtggagcca aatgaagaaa atgaagatga 
aggcattgat gatgatgaag attttatctc 
tgaccacaca aaacagaacc aggactggac 
agtgctactt cagacaacca caaggatgac 
tgaaggaaac tggaacccag aagcacaccc 
agaagagacc ccacattcta caagcacaat 



aagagacaga cacctcagtt tttctggatc 60 
cagcaccatt tcaaccacac cacgggcttt 12 0 
tcagtggaac ccaagccatt caaatccgga 18 0 
tgatgtagac agaaatggca ccactgctta 240 
tcccctcatt caccatgagc atcatgagga 300 
ccaggcaact cctagtagta caacgg 356 



<210> 137 
<211> 356 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 254, 264, 279, 281, 290, 328, 342 
<223> n = A,T,C or G 



<400> 137 

gcaggtggag aagacatttt attgttcctg gggtctctgg aggcccattg gtggggctgg 60 
gtcactggct gcccccggaa cagggcgctg ctccatggct ctgcttgtgg tagtctgtgg 12 0 
ctatgtctcc cagcaaggac agaaactcag aaaaatcaat cttcttatcc tcattcttgt 180 
cctttttctc aaagacatcg gcgaggtaat ttgtgccctt tttacctcgg cccgcgacca 240 
cgctaaggcc aaanttccag acanayggcc gggccggtnc nataggggan cccaacttgg 300 
ggacccaaac tctggcgcgg aaacacangg gcataagctt gnttcctgtg gggaaa 356 

<210> 138 
<211> 353 
<212> DNA 
<213> Homo sapiens 



<400> 138 

aggtccagtc ctccacttgg cctgatgaga gtggggagtg gcaagggacg tttctcctgc 60 
aatagacact tagatttctc tcttgtggga agaaaccacc tgtccatcca ctgactcttc 120 
tacattgatg tggaaattgc tgctgctacc accacctcct gaagaggctt ccctgatgcc 180 
aatgccagcc atcttggcat cctggccctc gagcaggctg cggtaagtag cgatctcctg 240 
ctccagccgt gtctttatgt caagcagcat cttgtactcc tggttctgag cctccatctc 300 
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gcatcggagc 


tcactcagac 


ctcgsccgsg 


mssmcgctam 


gccgaattcc 


age 


353 


<210> 139 














<211> 371 














<212> DNA 














<213> Homo 


sapiens 












<400> 139 














agcgtggtcg 


cggccgaggt 










60 


agacatattc 


tacacttcaa 


agctttggtg 


caattcccat 


cgaccagagt 


tggtccgacc 


120 


agccttggaa 


aggtcactga 


aaaatcttca 


attggattat 


gttgacctct 


accttattca 


180 


ttttccagtg 


tctgtaaagc 


caggtgagga 


agtgatccca 


aaagatgaaa 


atggaaaaat 


240 


actatttgac 


acagtggatc 


tctgtgccac 


gtgggaggcc 


gtggagaagt 


gtaaagatgc 


300 


aggattggac 


ctgcccgggc 


ggccgctcga 


aagccgaatt 


ccagcacact 


ggcggccgtt 




actagtggat 


c 










371 


<210> 140 














<211> 370 














<212> DNA 














<213> Homo 


sapiens 












<400> 140 














tagcgtggtc 


gcggccgagg 


tccatctccc 


tttgggaact 


agggggctgc 


tggtgggaaa 




tgggagccag 


ggcagatgtt 


gcattccttt 


gtgtccctgt 


aaatgtggga 


ctacaagaag 


120 


aggagctgcc 


tgagtggtac 


tttctcttcc 


tggtaatcct 


ctggcccagc 


ctcatggcag 


180 


aatagaggta 


tttttaggct 


atttttgtaa 


tatggcttct 


ggtcaaaatc 


cctgtgtagc 


240 


tgaattccca 


agccctgcat 


tgtacagccc 


cccactcccc 


tcaccaccta 


ataaaggaat 


300 


agttaacact 


caaaaaaaaa 


aaaaaacctg 


cccgggcggc 


cgctcgaaag 


ccgaattcca 




gcacactggc 












370 


<210> 141 














<211> 371 














<212> DNA 














<213> Homo 


sapiens 












<400> 141 














tagcgtggtc 












60 


gggtgtaggc 


agtgcaggag 


ccctcatcca 


gtggcaggga 


acaggggtca 


tcactatccc 


120 


aaggagcttc 


agggtcctgg 


tactcctcca 


cagaatactc 


ggagtattca 


gagtactcat 


180 


catcctcagg 


gggtacccgc 


tcttcctcct 


ctgcatgaga 


gacgcggagc 


acaggcacag 


240 


catggagctg 


ggagccggca 


gtgtctgcag 


cataactagg 


gaggggtcgt 


gatccagatg 


300 


cgatgaactg 


gccctggcag 


gcacagtgct 


gactcatctc 


ttggcgacct 


gcccgggcgg 




ccgctcgaag 


c 










371 


<210> 142 














<211> 343 














<212> DNA 














<213> Homo 


sapiens 












<400> 142 














gcgttttgag 


gccaatggtg 


taaaaggaaa 


tatcttcaca 


taaaaactag 


atggaagcat 


60 


tgtcagaaac 


ctctttgtga 


tgtttgcttt 


caactcacag 


agttgaacat 


tccttttcat 


120 


agagcagttt 


tgaaacactc 


ttttgtagaa 


tttgcaagcg 


gatgattgga 


tegctatgag 


180 


gtcttcattg 


gaaacgggat 


acctttacat 


aaaaactaga 


cagtagcatt 


ctcagaaatt 


240 


tctttgggat 


gtgggcattc 


aacccacaga 


ggagaacttc 


atttgataga 


gcagttttga 


300 


aacacccttt 


ttgtagaatc 


tacaggtgga 


catttagagt 


get 




343 



<210> 143 
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<211> 354 
<212> DNA 
<213> Homo sapiens 



<400> 143 

aggtctgatg gcagaaaaac tcagactgtc 
catcaggagt gggatgggaa ggaaagcaca 
gtggtggagt gtgtcatgaa caatgtcacc 
aaattccatc atcactttgg acaggagtta 
agcaaatctc catactgttt ctttcttttt 
cataaacatt ttacatgcag ctatttcaaa 



tgcaacttta cagatggtgc attggttcag 60 

ataacaagaa aattgaaaga tgggaaatta 12 0 

tgtactcgga tctatgaaaa agtagaataa 180 

attaagagaa tgaccaagct cagttcaatg 24 0 

tttttcatta ctgtgttcaa ttatctttat 300 

gtgtgttgga ttaattagga teat 354 



<210> 144 
<211> 353 
<212> DNA 
<213> Homo sapiens 



<400> 144 

ggtcaaggac ctgggggacc cccaggtcca 
cctagagcac atctggatct cagccccacc 
aagatgacag actaagtagg attctgecat 
gttaagttgc ttaactttca ttctgtctta 
gaaaccatgc cccagagaag gttaagtgac 
aggtttgect gataccagac ctgtggcccc 



gcagccacat gattctgeag cagacaggga 60 

cctggcaacc tgcctgccta gagaactccc 12 0 

ttagaataat tctggtatcc tgggcgttgc 180 

cgatagtctt cagaggtggg aacagatgaa 240 

ttcctcttta tggagccagt gttccaacct 300 

acctcccatg caggtctctg tgg 353 



<210> 145 
<211> 371 
<212> DNA 
<213> Homo sapiens 



<400> 145 

caggtctgtc ataaactggt ctggagtttc 
ttcctgagac ttgctggcct etcegttgag 
attgecactg ttgatcacta gctttttctt 
aatgeaaact gcaagaatca aagecaagge 
tggaatttgg ggtgtcctta taggaccaga 
atgtgagacc tcggccgcga ccacgctaag 
tagtggatcc g 



tgacgactcc ttgttcacca aatgeaccat 60 
tccacttggc tttctgtcct ccacagctcc 120 
ctgcccacac cttcttcgac tgttgactgc 18 0 
caagagggat gecaagatga tcagccattc 24 0 
ggttgtgttt gctccacctt cttgactccc 300 
ccgaattcca gcacactggc ggcccgttac 360 
371 



<210> 146 
<211> 355 
<212> DNA 
<213> Homo sapiens 



<400> 146 

ggtcctccgt cctcttccca gaggtgtcgg 
caggatggcg agtagcagcg gctccaaggc 
ggtacggaag ategggtctg gctccttcgg 
eggegaggaa gtggcagtga agctagaatc 
cgagagcaag ctctataaga ttcttcaagg 
tggtcaggaa aaagactaca atgtactagt 



ggcttggccc cagcctccat cttcgtctct 60 

tgaattcatt gtcggaggga aatataaact 120 

ggacatctat ttggcgatca acatcaccaa 18 0 

teagaaggee aggcatcccc agttgctgta 24 0 

tggggttggc atcccccaca tacggtggta 300 

catggatctt ctgggaccta geetc 355 



<210> 147 
<211> 355 
<212> DNA 
<213> Homo sapiens 



<400> 147 



WO 02/47534 



68 



PCT7US01/47576 



ggtctgttac aaaatgaaga cagacaacac 
tactatgcac gtgctgtgat tttgaacata 
tgacttttta ggttggctga tccatcaatc 
ttgttaggag caaagctgac ctgaacagca 
tttttcccat aatatgggaa atattttaag 
acatttggta tatcttcatt ctttgaaaca 

<210> 148 
<211> 369 
<212> DNA 
<213> Homo sapiens 



aacatttact ctgtggagat atcctactca 60 
actcgtccca aaaacttgtc acgatcatcc 120 
ttgcactcaa ctgttacttc tttcccagtg 18 0 
accaatggct gtagataccc aacatgcagt 24 0 
tctatcattc cattatgagg ataaactgct 300 
caatctatcc ttggcactcc ttcag 355 



<400> 148 

aggtctctct ccccctctcc ctctcctgcc 
caccttcctt catgatgtgg gaagagtgct 
agggagtgtg ccgagggctt ctgagaaggt 
atgtggcagc ccctcttctt caagtggctc 
gctgcagcag cctccatcca gcctgaggat 
gaaaagatga gagaagttac agactctcct 
acttcttca 



agccaagtga agacatgctt acttcccctt 60 
gcaacccagc cctagccaac accgcatgag 12 0 
ttctctcaca tctagaaaga agcgcttaag 180 
ttgtcctgtt gccctgggag ttctcaaatt 240 
gacatcaata cacagaggaa gaagagtcag 300 
gggcgacccc gagagcttac cattcctcag 360 
369 



<210> 149 
<211> 620 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 169, 171, 222, 472, 528, 559, 599 
<223> n = A,T,C or G 



<400> 149 

actagtcaaa aatgctaaaa taatttggga 
catgtttatc ttttattatg ttttgtgaag 
gccaatattt ccttatatct atccataaca 
gtgaaactta acactttata aggtaaaaat 
gttcttgtta tttccaaata gaatggactt 
ataaggttaa aagttgttaa tgaccaaaca 
tttcaagcct tcgaactatt taaggaaagc 
gagaatttct cattaatatc ctgaatcatt 
atgtctctaa gaaagtacta tttcatggtc 
tttcccttaa gtgtgaaant atttaaaatg 
agggttaagg gtgttgggga 

<210> 150 
<211> 371 
<212> DNA 
<213> Homo sapiens 



gaaaatattt tttaagtagt gttatagttt 60 
ttgtgtcttt tcactaatta cctatactat 120 
tttatactac atttgtaana naatatgcac 180 
gaggtttcca anatttaata atctgatcaa 240 
ggtctgttaa gggctaagga gaagaggaag 300 
ttctaaaaga aatgcaaaaa aaaagtttat 360 
aaaatcattt cctaaatgca tatcatttgt 420 
catttcacta aggctcatgt tnactccgat 4 80 
caaacctggt tgccatantt gggtaaaggc 540 
aaattttcct ctttttaaaa attctttana 600 
620 



<400> 150 

ggtccgatca aaacctgcta cctccccaag 
gagcaaccag tatcacttcc ctgtttataa 
atgctgaaaa ccacctggtc tgcatgtatg 
aaaatttaat tttagggatt catttctata 
atatgtgtaa ggtgaaattt atggtatttg 
tcatttttcc cccagtgaat gatttagaat 
ttacttttat a 



actttactag tgccgataaa ctttctcaaa 60 
aacctctaac catctctttg ttctttgaac 12 0 
cccgaatttg yaattctttt ctctcaaatg 18 0 
ttttcacata tgtagtatta ttatttcctt 240 
agtgtgcaag aaaatatatt tttaaagctt 300 
tttttatgta aatatacaga atgttttttc 360 
371 
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<210> 151 

<211> 4655 

<212> DNA 

<213> Homo sapiens 

<400> 151 

gggacttgag ttctgttatc 
gggttggcaa aatcctggag 
acatgttgta cctggaaaac 
tggggctcct gaacagcatg 
ataacacaga ccacgcgcag 
ccaccttcga tgctctctct 
cgcacagttt cgacgtgtcc 
attccactga actgaagaaa 
aggtgatgac cccacctcct 
ctgagcacgt cacggaggtg 
acgagggaca gattgcccct 
agtatgtaga agatcccatc 
aggttggcac tgaattcacg 
gagggatgaa ccgccgtcca 
tcctgggccg acgctgcttt 
cggatgaaga tagcatcaga 
cgaagcgccc gtttcgtcag 
gatccccaga tgatgaactg 
tggtgaagat caaagagtcc 
cgtacaggca acagcaacag 
agtctccatc ttcatatggt 
agctgccttc tgtgagccag 
ccattcctga tggcatggga 
gagacatgaa tggactcagc 
cctcccactg cacaccccca 
cgaggttggg ctgttcatca 
atcagattga gcattactcc 
gacatgcgat ctggaagggc 
ctcatctcct gcggacccca 
ggggtgagcg tgttattgat 
cccgagatga gtggaatgac 
gcatcaaaga ggagggggag 
tgccagcccc ctaaaagcac 
ttcctcttgt ctgatttctt 
atctgacctg gcatctaatt 
actgtagctt gccatggcta 
tgcagagatt tctcattgac 
atataaatgt ataaatatac 
aatgtaattt aaatgaaaga 
ttttggatgg cttgtctata 
tagagcttaa tgctacatgt 
ctaaatacat gccacatcaa 
aagactgtag atatgtattc 
tctagtgatg atggttcacg 
caaacgtcct ctttagtttt 
ccagttcaaa aacacccgac 
taccagatac cttatcttac 
ttaaaactaa atttcactac 
gaattctgat tgatttgatt 
actgtctatt aatattcagg 
cagtaagata tctcaatgaa 
aatattgttt ggtaaatgtt 



ttcttaagta gattcatatt gtaagggtct cggggtgggg 60 
ccagaagaaa ggacagcagc attgatcaat cttacagcta 120 
aatgcccaga ctcaatttag tgagccacag tacacgaacc 180 
gaccagcaga ttcagaacgg ctcctcgtcc accagtccct 240 
aacagcgtca cggcgccctc gccctacgca cagcccagct 300 
ccatcacccg ccatcccctc caacaccgac tacccaggcc 360 
ttccagcagt cgagcaccgc caagtcggcc acctggacgt 420 
ctctactgcc aaattgcaaa gacatgcccc atccagatca 48 0 
cagggagctg ttatccgcgc catgcctgtc tacaaaaaag 54 0 
gtgaagcggt gccccaacca tgagctgagc cgtgaattca 600 
yctagtcatt tgattcgagt agaggggaac agccatgccc 660 
acaggaagac agagtgtgct ggtaccttat gagccacccc 72 0 
acagtcttgt acaatttcat gtgtaacagc agttgtgttg 780 
attttaatca ttgttactct ggaaaccaga gatgggcaag 84 0 
gaggcccgga tctgtgcttg cccaggaaga gacaggaagg 900 
aagcagcaag tttcggacag tacaaagaac ggtgatggta 960 
aacacacatg gtatccagat gacatccatc aagaaacgaa 1020 
gtatacttac cagtgagggg ccgtgagact tatgaaatgc 1080 
ctggaactca tgcagtacct tcttcagcac acaattgaaa 1140 
cagcagcacc agcacttact tcagaaacag acctcaatac 1200 
aacagctccc cacctctgaa caaaatgaac agcatgaaca 12 60 
cttatcaacc ctcagcagcg caacgccctc actcctacaa 1320 
gccaacattc ccatgatggg cacccacatg ccaatggctg 1380 
cccacccagg cactccctcc cccactctcc atgccatcca 1440 
cctccgtatc ccacagattg cagcattgtc agtttcttag 1500 
tgtctggact atttcacgac ccaggggctg accaccatct 1560 
atggatgatc tggcaagtct gaaaatccct gagcaatttc 1620 
atcctggacc accggcagct ccacgaattc tcctcccctt 1680 
agcagtgcct ctacagtcag tgtgggctcc agtgagaccc 17 4 0 
gctgtgcgat tcaccctccg ccagaccatc tctttcccac 18 00 
ttcaactttg acatggatgc tcgccgcaat aagcaacagc 18 60 
tgagcctcac catgtgagct cttcctatcc ctctcctaac 1920 
tcctgcttaa tcttcaaagc cttctcccta gctcctcccc 1980 
aggggaagga gaagtaagag gcttacttct taccctaacc 2040 
ctgattctgg ctttaagcct tcaaaactat agcttgcaga -2100 
ggtagaagtg agcaaaaaag agttgggtgt ctccttaagc 2160 
ttttataaag catgttcacc cttatagtct aagactatat 222 0 
agtatagatt tttgggtggg gggcattgag tattgtttaa 2280 
aaattgagtt gcacttattg accatttttt aatttacttg 2340 
ctccttccct taaggggtat catgtatggt gataggtatc 24 00 
gagtgacgat gatgtacaga ttctttcagt tctttggatt 24 60 
acctttgagt agatccattt ccattgctta ttatgtaggt 2520 
ttttctcagt gttggtatat tttatattac tgacatttct 2580 
ttggggtgat ttaatccagt tataagaaga agttcatgtc 2 640 
tggttgggaa tgaggaaaat tcttaaaagg cccatagcag 2700 
gtcatgtatt tgagcatatc agtaaccccc ttaaatttaa 27 60 
aatattgatt gggaaaacat ttgctgccat tacagaggta 2 820 
tagattgact aactcaaata cacatttgct actgttgtaa 2880 
gggatgaatg ccatctatct agttctaaca gtgaagtttt 2 940 
gtaaatagga atcattcaga aatgttgagt ctgtactaaa 3000 
ccataaattc aactttgtaa aaatcttttg aagcatagat 3060 
tcttttgttt ggtaaatgtt tcytttaaag accctcctat 3120 
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tctataaaac tctgcatgta gaggcttgtt tacctttctc tctctaaggt ttacaatagg 3180 
agtggtgatt tgaaaaatat aaaattatga gattggtttt cctgtggcat aaattgcatc 3240 
actgtatcat tttctttttt aaccggtaag agtttcagtt tgttggaaag taactgtgag 33 00 
aacccagttt cccgtccatc tcccttaggg actacccata gacatgaaag gtccccacag 33 60 
agcaagagat aagtctttca tggctgctgt tgcttaaacc acttaaacga agagttccct 3420 
tgaaactttg ggaaaacatg ttaatgacaa tattccagat ctttcagaaa tataacacat 3480 
ttttttgcat gcatgcaaat gagctctgaa atcttcccat gcattctggt caagggctgt 3540 
cattgcacat aagcttccat tttaatttta aagtgcaaaa gggccagcgt ggctctaaaa 3600 
ggtaatgtgt ggattgcctc tgaaaagtgt gtatatattt tgtgtgaaat tgcatacttt 3660 
gtattttgat tatttttttt ttcttcttgg gatagtggga tttccagaac cacacttgaa 3720 
accttttttt atcgtttttg tattttcatg aaaataccat ttagtaagaa taccacatca 3780 
aataagaaat aatgctacaa ttttaagagg ggagggaagg gaaagttttt ttttttatta 3840 
tttttttaaa attttgtatg ttaaagagaa tgagtccttg atttcaaagt tttgttgtac 3900 
ttaaatggta ataagcactg taaacttctg caacaagcat gcagctttgc aaacccatta 3960 
aggggaagaa tgaaagctgt tccttggtcc tagtaagaag acaaactgct tcccttactt 4020 
tgctgagggt ttgaataaac ctaggacttc cgagctatgt cagtactatt caggtaacac 4080 
tagggccttg gaaatccctg tactgtgtct catggatttg gcactagcca aagcgaggca 4140 
ccccttactg gcttacctcc tcatggcagc ctactctcct tgagtgtatg agtagccagg 4200 
gtaaggggta aaaggatagt aagcatagaa accactagaa agtgggctta atggagttct 42 60 
tgtggcctca gctcaatgca gttagctgaa gaattgaaaa gtttttgttt ggagacgttt 4320 
ataaacagaa atggaaagca' gagttttcat taaatccttt tacctttttt ttttcttggt 4380 
aatcccctaa aataacagta tgtgggatat tgaatgttaa agggatattt ttttctatta 4440 
tttttataat tgtacaaaat taagcaaatg ttaaaagttt tatatgcttt attaatgttt 4500 
tcaaaaggta ttatacatgt gatacatttt ttaagcttca gttgcttgtc ttctggtact 45 60 
ttctgttatg ggcttttggg gagccagaag ccaatctaca atctcttttt gtttgccagg 4620 
acatgcaata aaatttaaaa aataaataaa aacta 4 655 

<210> 152 

<211> 586 

<212> PRT 

<213> Homo sapiens 

<400> 152 



Met 


Leu 


Tyr 


Leu 


Glu 


Asn 


Asn 


Ala 


Gin 


Thr Gin Phe Ser Glu Pro 


Gin 


1 








5 










10 15 




Tyr 


Thr 


Asn 


Leu 


Gly 


Leu 


Leu 


Asn 


Ser 


Met Asp Gin Gin He Gin 


Asn 








20 










25 


30 




Gly 


Ser 


Ser 


Ser 


Thr 


Ser 


Pro 


Tyr 


Asn 


Thr Asp His Ala Gin Asn 


Ser 






35 










40 




45 




Val 


Thr 


Ala 


Pro 


Ser 


Pro 


Tyr 


Ala 


Gin 


Pro Ser Ser Thr Phe Asp Ala 




50 










55 






60 




Leu 


Ser 


Pro 


Ser 


Pro 


Ala 


He 


Pro 


Ser 


Asn Thr Asp Tyr Pro Gly 


Pro 


65 










70 








75 


80 


His 


Ser 


Phe 


Asp 


Val 
85 


Ser 


Phe 


Gin 


Gin 


Ser Ser Thr Ala Lys Ser 
90 95 


Ala 


Thr 


Trp 


Thr 


Tyr 


Ser 


Thr 


Glu 




Lys 


Lys Leu Tyr Cys Gin He 


Ala 








100 










105 


110 




Lys 


Thr 


Cys 


Pro 


He 


Gin 


He 


Lys 


Val 


Met Thr Pro Pro Pro Gin' Gly 






115 










120 




125 




Ala 


Val 


He 


Arg 


Ala 


Met 


Pro 


Val 


Tyr 


Lys Lys Ala Glu His Val 


Thr 




130 










135 






140 




Glu 


Val 


Val 


Lys 


Arg 


Cys 


Pro 


Asn 


His 


Glu Leu Ser Arg Glu Phe 


Asn 


145 










150 








155 


160 


Glu 


Gly 


Gin 


He 


Ala 


Pro 


Ser 


Ser 


His 


Leu He Arg Val Glu Gly 


Asn 










165 










170 175 




Ser 


His 


Ala 


Gin 


Tyr 


Val 


Glu 


Asp 


Pro 


He Thr Gly Arg Gin Ser 


Val 








180 










185 


190 




Leu 


Val 


Pro 


Tyr 


Glu 


Pro 


Pro 


Gin 


Val 


Gly Thr Glu Phe Thr Thr 


Val 
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195 200 205 



Leu 


Tyr Asn 


Phe 


Met Cys 


Asn 


Ser 


Ser Cys 


Val Gly Gly Met Asn Arg 




210 








215 






220 






Arg 


Pro 


lie 


Leu 


He lie 


Val 


Thr 


Leu Glu 


Thr Arg Asp 


Gly Gin Val 


225 








230 








235 




240 


Leu 


Gly 


Arg 


Arg 


Cys Phe 


Glu 


Ala 


Arg He 


Cys Ala 


Cys 


Pro Gly Arg 










245 






250 






255 


Asp 


Arg 


Lys 


Ala 


Asp Glu 


Asp 


Ser 


He Arg 


Lys Gin 


Gin 


Val Ser Asp 








260 








265 






270 


Ser 


Thr 


Lys 


Asn 


Gly Asp 


Gly 


Thr 


Lys Arg 


Pro Phe 


Arg 


Gin Asn Thr 






275 








280 






285 




His 


Gly 


He 


Gin 


Met Thr 


Ser 


He 


Lys Lys 


Arg Arg 


Ser 


Pro Asp Asp 




290 








295 






300 




Glu 


Leu 


Val 


Tyr 


Leu Pro 


Val Arg 


Gly Arg 


Glu Thr 


Tyr 


Glu Met Leu 


305 








310 








315 




320 


Val 


Lys 


He 


Lys 


Glu Ser 


Leu 


Glu 


Leu Met 


Gin Tyr 


Leu 


Leu Gin His 










325 






330 






335 


Thr 


He 


Glu 


Thr 


Tyr Arg 


Gin 


Gin 


Gin Gin 


Gin Gin 


His 


Gin His Leu 








340 








345 






350 


Leu 


Gin 


Lys 


Gin 


Thr Ser 


He 


Gin 


Ser Pro 


Ser Ser 


Tyr 


Gly Asn Ser 






355 








360 






365 


Ser 


Pro 


Pro 


Leu 


Asn Lys 


Met 


Asn 


Ser Met 


Asn Lys 


Leu 


Pro Ser Val 




370 








375 






380 






Ser 


Gin 


Leu 


He 


Asn Pro 


Gin 


Gin Arg Asn 


Ala Leu 


Thr 


Pro Thr Thr 


385 








390 








395 




400 


lie 


Pro Asp Gly Met Gly 


Ala 


Asn 


He Pro 


Met Met 


Gly Thr His Met 










405 






410 






415 


Pro Met Ala Gly Asp Met 


Asn 


Gly Leu Ser 


Pro Thr 


Gin 


Ala Leu Pro 








420 








425 






430 


Pro 


Pro 


Leu 


Ser 


Met Pro 


Ser 


Thr 


Ser His 


Cys Thr 


Pro 


Pro Pro Pro 






435 








440 






4 45 




Tyr 


Pro 


Thr Asp 


Cys Ser 


He 


Val 


Ser Phe 


Leu Ala Arg Leu Gly Cys 




450 








455 






460 






Ser 


Sei- 


Cys 


Leu 


Asp Tyr 


Phe 


Thr 


Thr Gin 


Gly Leu 


Thr 


Thr He Tyr 


465 








470 








475 




480 


Gin 


Ile 


Glu 


His 


Tyr Ser 


Met Asp Asp Leu 


Ala Ser 


Leu 


Lys He Pro 










485 






490 






495 


Glu Gin Phe Arg His Ala 


He 


Trp 


Lys Gly 


He Leu 


Asp His Arg Gin 








500 








505 






510 


Leu 


His 


Glu 


Phe 


Ser Ser 


Pro 


Ser 


His Leu 


Leu Arg 


Thr 


Pro Ser Ser 






515 








520 






525 




Ala 


Ser 


Thr 


Val 


Ser Val 


Gly 


Ser 


Ser Glu 


Thr Arg Gly Glu Arg Val 




530 








535 






540 






lie 


Asp Ala Val 


Arg Phe 


Thr 


Leu Arg Gin 


Thr He 


Ser 


Phe Pro Pro 


545 








550 








555 




560 


Arg Asp Glu Trp Asn Asp 


Phe 


Asn 


Phe Asp 


Met Asp 


Ala 


Arg Arg Asn 










565 






570 






575 


Lys 


Gin 


Gin Arg 


He Lys 


Glu Glu Gly Glu 









580 585 



<210> 153 
<2H> 2007 
<212> DNA 
<213> Homo sapiens 

<400> 153 

gaattcgtcg ctgctccagg gaaagttctg ttactccact gactctctct tttcctgata 60 
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acatggccag caagaaagta attacagtgt 
tggccagggc aattttggag agcaaaaaat 
cttgaccaaa tgccctggag ctccagcgcc 
atgataaagc atcggtggac agtgccttaa 
acttctggga ccctctcaac caagataagg 
ccgccaagca cctgggtctg aagcacgtgg 
tgacggatgg caagctggag gtgccgcact 
tctggtccat tggcatcccc atgaccagtg 
tcgcggcgtg gcggcccgtg aaagcctctg 
tgggagatgt accaatggat ggtatctctg 
tttttaattc tccagaggaa tttttaggca 
caatacagca atatgctgat gttttgtcca 
agattacccc ggaagctttc gagaagctgg 
tgtgtcgttt ctatgaaatg aagccagacc 
ccaaagtcaa aagcttcagc cagtttatct 
agaaaatcag ctgttcagat aggcctctgc 
ttcctcttta cggcacaaca ttcatgttga 
caccgaagga tttcctgcgg tcgcctcttc 
cggtaatttg attcacattt aacttgctag 
aaaatgagaa gcctcggaac ttggagcttc 
actgggattt ctcctgggtg agtaatttca 
tccagttttc tcaactgcat tgcaaaattc 
aaaaatgaac atctttgtag agaattttct 
agcattggaa atgctaaaat tcagttttgc 
ttcatgaagt catctattga gccaccattc 
catttatcca ttctgcaaac ttttcttgag 
tcttcattcc tatgtgtttt cttatcaaag 
ctgtggttgg gttcaagtca tgccagggcc 
caaaatccag gggatctgca gtggggagcg 
agggtaggga tgtggaaaga caaggtgaca 
tggcttagca ttttctacat catattgtaa 
gtgagtgact aacagtcatc tttatcccag 
gttgattgac taaaaaaaaa aaaaaaa 

<210> 154 
<211> 2148 
<212> DNA 

<213> Homo sapiens 



ttggagcaac aggagctcaa ggtggctctg 12 0 
ttgcagtgag agcagtgacc agggatgtga 18 0 
ttggagctga ggtggtcaaa ggtgacctga 24 0 
aaggtgtcta tggggccttc ttggtgacca 300 
aagtgtgtcg ggggaagctg gtggcagact 360 
tgtacagcgg cctggagaac gtcaagcgac 42 0 
ttgacagcaa gggcgaggtg gaggagtact 480 
tccgcgtggc ggcctacttt gaaaactttc 540 
atggagatta ctacaccttg gctgtaccga 600 
ttgctgatat tggagcagcc gtctctagca 660 
aggccgtggg gctcagtgca gaagcactaa 72 0 
aggctttggg gaaagaagtc cgagatgcaa 78 0 
gattccctgc agcaaaggaa atagccaata 840 
gagatgtcaa tctcacccac caactaaatc 900 
cagagaacca gggagccttc aagggcatgt 960 
accacacagc ctctttcctc tctgatcctt 1020 
cagaacatgc tggaatgcaa ttgtttgcaa 1080 
agtaggaagc actgcattgg tgataggaca 1140 
ttagtgataa gggtggtaca actgtttggt 12 00 
tctcctacca ctaatgggag ggcagattat 12 60 
agccctaatg ctgaaattcc cctaggcagc 1320 
ccagtgaact tt'taagtact tttaacttaa 1380 
ggggaacatg gtgttcaatg aacaagcaca 14 40 
ctcaagattg gaagtttatt ttctgactca 1500 
aattattcat ctattaattc cttgatcctt 1560 
caccagcacg ggtggccatt tgtggacttc 162 0 
tgatccactc tcgaaaggct cctttccagt 1680 
agggggccca tctcctcgtt tagctctagg 17 4 0 
ggggcaggaa gctggaggga aggcctgtga 1800 
gaaggaccca ataggacctt tctatatctc 18 60 
tcgtcttatt tgctagtttt cttccttact 1920 
tgcctggtac ataataagtg atcaataaat 1980 
2007 



<400> 154 

gaattcgtcg ctgctccagg gaaagttctg 
acatggccag caagaaagta attacagtgt 
tggccagggc aattttggag agcaaaaaat 
cttgaccaaa tgccctggag ctccagcgcc 
atgataaagc atcggtggac agtgccttaa 
cacctgggtc tgaagcacgt ggtgtacagc 
ggcaagctgg aggtgccgca ctttgacagc 
attggcatcc ccatgaccag tgtccgcgtg 
tggcggcccg tgaaagcctc tgatggagat 
gtaccaatgg atggtatctc tgttgctgat 
tctccagagg aatttttagg caaggccgtg 
caatatgctg atgttttgtc caaggctttg 
tgtgctatag atgaccagaa aacagtggaa 
tggtccttga gggaacatga ccatgtatag 
ctaattctgg aataaacacg acaaaccaga 
ctgcctctat ccttgattac cccggaagct 
gaaatagcca atatgtgtcg tttctatgaa 
caccaactaa atcccaaagt caaaagcttc 



ttactccact gactctctct tttcctgata 60 
ttggagcaac aggagctcaa ggtggctctg 12 0 
ttgcagtgag agcagtgacc agggatgtga 18 0 
ttggagctga ggtggtcaaa ggtgacctga 240 
aaggggaagc tggtggcaga ctccgccaag 300 
ggcctggaga acgtcaagcg actgacggat 360 
aagggcgagg tggaggagta cttctggtcc 42 0 
gcggcctact ttgaaaactt tctcgcggcg 480 
tactacacct tggctgtacc gatgggagat 54 0 
attggagcag ccgtctctag catttttaat 600 
gggctcagtg cagaagcact aacaatacag 660 
gggaaagaag tccgagatgc aaagactatc 72 0 
gaaggtttca tggaagacgt gggcttgagt 780 
acagaggagg catcaagaag gctggcctgg 840 
ggcagtacgg gaaggaggca aattctggct 900 
ttcgagaagc tgggattccc tgcagcaaag 960 
atgaagccag accgagatgt caatctcacc 1020 
agccatttta tctcagagaa ccagggagcc 108 0 
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ttcaagggca tgtagaaaat cagctgttca gataggcctc tgcaccacac agcctctttc 1140 
ctctctgatc cttttcctct ttacggcaca acattcatgt tgacagaaca tgctggaatg 1200 
caattgtttg caacaccgaa ggatttcctg cggtcgcctc ttcagtagga agcactgcat 12 60 
tggtgatagg acacggtaat ttgattcaca tttaacttgc tagttagtga taagggtggt 1320 
acaactgttt ggtaaaatga gaagcctcgg aacttggagc ttctctccta ccactaatgg 13 80 
gagggcagat tatactggga tttctcctgg gtgagtaatt tcaagcccta atgctgaaat 1440 
tcccctaggc agctccagtt ttctcaactg cattgcaaaa ttcccagtga acttttaagt 1500 
acttttaact taaaaaaatg aacatctttg tagagaattt tctggggaac atggtgttca 15 60 
atgaacaagc acaagcattg gaaatgctaa aattcagttt tgcctcaaga ttggaagttt 1620 
attttctgac tcattcatga agtcatctat tgagccacca ttcaattatt catctattaa 1680 
ttccttgatc cttcatttat ccattctgca aacttttctt gagcaccagc acgggtggcc 174 0 
atttgtggac ttctcttcat tcctatgtgt tttcttatca aagtgatcca ctctcgaaag 1800 
gctcctttcc agtctgtggt tgggttcaag tcatgccagg gccagggggc ccatctcctc 18 60 
gtttagctct aggcaaaatc caggggatct gcagtgggga gcgggggcag gaagctggag 1920 
ggaaggcctg tgaagggtag ggatgtggaa agacaaggtg acagaaggac ccaataggac 1980 
ctttctatat ctctggctta gcattttcta catcatattg taatcgtctt atttgctagt 2040 
tttcttcctt actgtgagtg actaacagtc atctttatcc cagtgcctgg tacataataa 2100 
gtgatcaata aatgttgatt gactaaatga aaaaaaaaaa aaaaaaaa 2148 

<210> 155 
<211> 153 
<212> PRT 

<213> Homo sapiens 
<400> 155 



Met 


Thr 


Ser Val 


Arg Val 


Ala 


Ala 


Tyr 


Phe 


Glu 


Asn 


Phe 


Leu Ala 


Ala 


1 






5 










10 








15 




Trp 


Arg 


Pro Val 


Lys 


Ala 


Ser 


Asp 


Gly 


Asp 


Tyr 


Tyr 


Thr 


Leu Ala 


Val 






20 










25 










30 




Pro 


Met 


Gly Asp 


Val 


Pro 


Met 


Asp 


Gly 


He 


Ser 


Val 


Ala 


Asp He 


Gly 






35 








40 










45 






Ala 


Ala 


Val Ser 


Ser 


lie 


Phe 


Asn 


Ser 


Pro 


Glu 


Glu 


Phe 


Leu Gly Lys 




50 








55 










60 








Ala 


Val 


Gly Leu 


Ser 


Ala 


Glu 


Ala 


Leu 


Thr 


lie 


Gin 


Gin 


Tyr Ala 


Asp 


65 








70 










75 








80 


Val 


Leu 


Ser Lys 


Ala 


Leu 


Gly 


Lys 


Glu 


Val 


Arg 


Asp 


Ala 


Lys He 


Thr 








85 










90 








95 




Pro 


Glu 


Ala Phe 


Glu 


Lys 


Leu 


Gly 


Phe 


Pro 


Ala 


Ala 


Lys 


Glu He 


Ala 






100 










105 










110 




Asn 


Met 


Cys Arg 


Phe 


Tyr 


Glu 


Met 


Lys 


Pro 


Asp 


Arg 


Asp 


Val Asn 


Leu 






115 








120 










125 






Thr 


His 


Gin Leu 


Asn 


Pro 


Lys 


Val 


Lys 


Ser 


Phe 


Ser 


Gin 


Phe He 


Ser 




130 








135 










140 








Glu 


Asn 


Gin Gly 


Ala 


Phe 


Lys 


Gly 


Met 














145 








150 





















<210> 156 
<211> 128 
<212> PRT 

<213> Homo sapiens 
<400> 156 

Met Thr Ser Val Arg Val Ala Ala 

1 5 
Trp Arg Pro Val Lys Ala Ser Asp 
20 

Pro Met Gly Asp Val Pro Met Asp 



Tyr Phe Glu Asn Phe Leu Ala Ala 

10 15 
Gly Asp Tyr Tyr Thr Leu Ala Val 
25 30 
Gly He Ser Val Ala Asp He Gly 
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35 40 45 



Ala 


Ala 


Val 


Ser Ser 


He 


Phe Asn 


Ser 


Pro 


Glu 


Glu Phe Leu 


Gly 


Lys 




50 








55 








60 






Ala 


Val 


Gly 


Leu Ser 


Ala 


Glu Ala 


Leu 


Thr 


He 


Gin Gin Tyr Ala 


Asp 


65 








70 








75 






80 


Val 


Leu 


Ser 


Lys Ala Leu Gly Lys Glu Val Arg 


Asp Ala Lys 


Thr 


He 








85 








90 






95 




Cys 


Ala 


He 


Asp Asp 


Gin 


Lys Thr 


Val 


Glu 


Glu 


Gly Phe Met 


Glu 


Asp 








100 






105 






110 






Val 


Gly Leu 


Ser Trp 


Ser 


Leu Arg 


Glu 


His 




His Val Ala 


Gly 


Ala 



115 120 125 



<210> 157 
<211> 424 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 320, 322 
<223> n = A,T,C or G 



<400> 157 

ctgcagcccg ggggatccac 
ggatacatta cagcagacat 
aattcagtca ccactgttat 
tattagattt ccttgtatgc 
agaggaagga gaaaactgca 
agcccagaaa cttctctgcn 
ccccagccat cgagtcagtt 
tgct 

<210> 158 
<211> 2099 
<212> DNA 
<213> Homo sapiens 



tagtccagtg tggtggaatt 
ggaaatataa ttttaaaaaa 
attaccttct ccaggaaccc 
aaagtttttg ttgaaagctg 
tcataacttt acagaattga 
gnatctggct tgtccatctg 
tgtgcccatg aataatacac 



cattggtctt 


tacaagactt 


60 


tttctctcca 


acctccttca 


120 


tccagtgggg 


aaggctgcga 


180 


tgctcagagg 


aggtgagagg 


240 


atctagagtc 


ttccccgaaa 


300 


gtctaaggtg 


gctgcttctt 


360 


gacctgctat 


ttcccatgac 


420 






424 



<400> 158 

ccgcggttaa aaggcgcagc aggtgggagc 
ccgacagccg gcggcgcccg agcccgacct 
ccgcgcagag cccgcgccag ggccgccggc 
aaggcacttc ctgtcggtga agaagacctg 
caaacggggc tgacctccct tcctggggag 
agaagatctg gctaaacaat ttctgtatgg 
ttcatgcatc tttaattcaa tttgaatatt 
attgacattc gtatcatcac tgtgcaccat 
aaggaggtct gaaaccctcg cagagggatc 
gcagtcgttg gaaacaggac tcagggataa 
actttcatcg ggggtgtcaa caaacactcc 
atctttattt tccgagtcat gatcctcgtg 
caagaggact tcgtctgcaa cacactgcaa 
tttttcccgg tgtcccacat ccggctgtgg 
gcgctgctgg tggccatgca tgtggcctac 
cgaggagaga agaggaatga tttcaaagac 
atagaggggt cgctgtggtg gacgtacacc 
gcagccttta tgtatgtgtt ttacttcctt 
aaatgtggga ttgacccctg ccccaacctt 



cggggccttc acccgaaacc cgacgagagc 60 
gcctgcccag ccggagcgaa gggcgccgcc 120 
cgcagagcag ttaaaacgtg caggcaccag 180 
tctccggtgt cacgggcatc ctgtgttttg 24 0 
caggaagggt cagggaagga aaagaagtac 30 0 
cgaaagaaaa attctaactt gtacgccctc 3 60 
ccaggcgaca tcctcactga ccgagcaaag 42 0 
tggcttctag gcactccagt ggggtaggag 480 
ttgccctcat tctttgggtc tgaaacactg 54 0 
accagcgcaa tggattgggg gacgctgcac 600 
accagcatcg ggaaggtgtg gatcacagtc 660 
gtggctgccc aggaagtgtg gggtgacgag 72 0 
ccgggatgca aaaatgtgtg ctatgaccac 7 80 
gccctccagc tgatcttcgt ctccacccca 84 0 
tacaggcacg aaaccactcg caagttcagg 900 
atagaggaca ttaaaaagca gaaggttcgg 960 
agcagcatct ttttccgaat catctttgaa 1020 
tacaatgggt accacctgcc ctgggtgttg 108 0 
gttgactgct ttatttctag gccaacagag 1140 
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aagaccgtgt ttaccatttt tatgatttct gcgtctgtga tttgcatgct gcttaacgtg 12 00 
gcagagttgt gctacctgct gctgaaagtg tgttttagga gatcaaagag agcacagacg 12 60 
caaaaaaatc accccaatca tgccctaaag gagagtaagc agaatgaaat gaatgagctg 132 0 
atttcagata gtggtcaaaa tgcaatcaca ggttcccaag ctaaacattt caaggtaaaa 13 8 0 
tgtagctgcg tcataaggag acttctgtct tctccagaag gcaataccaa cctgaaagtt 14 4 0 
ccttctgtag cctgaagagt ttgtaaatga ctttcataat aaatagacac ttgagttaac 1500 
tttttgtagg atacttgctc cattcataca caacgtaatc aaatatgtgg tccatctctg 1560 
aaaacaagag actgcttgac aaaggagcat tgcagtcact ttgacaggtt ccttttaagt 1620 
ggactctctg acaaagtggg tactttctga aaatttatat aactgttgtt gataaggaac 1680 
atttatccag gaattgatac gtttattagg aaaagatatt tttataggct tggatgtttt 17 40 
tagttctgac tttgaattta tataaagtat ttttataatg actggtcttc cttacctgga 1800 
aaaacatgcg atgttagttt tagaattaca ccacaagtat ctaaatttgg aacttacaaa 18 60 
gggtctatct tgtaaatatt gttttgcatt gtctgttggc aaatttgtga actgtcatga 1920 
tacgcttaag gtggaaagtg ttcattgcac aatatatttt tactgctttc tgaatgtaga 19 8 0 
cggaacagtg tggaagcaga aggctttttt aactcatccg tttgccaatc attgcaaaca 2040 
actgaaatgt ggatgtgatt gcctcaataa agctcgtccc cattgcttaa aaaaaaaaa 2099 

<210> 159 
<211> 291 
<212> PRT 

<213> Homo sapiens 



<400> 159 



Met 


Asp 


Trp 


Gly 


Thr 


Leu 


His 


Thr 


Phe 


He 


Gly 


Gly 


Val 


Asn 


Lys 


His 


1 








5 










10 










15 




Ser 


Thr 


Ser 


He 


Gly 


Lys 


Val 


Trp 


He 


Thr 


Val 


He 


Phe 


lie 


Phe 


Arg 








20 










25 










30 






Val 


Met 


He 


Leu 


Val 


Val 


Ala 


Ala 


Gin 


Glu 


Val 


Trp 


Gly 


Asp 


,Glu 


Gin 






35 










40 










45 








Glu 


Asp 


Phe 


Val 


Cys 


Asn 


Thr 


Leu 


Gin 


Pro 


Gly 


Cys 


Lys 


Asn 


Val 


Cys 




50 










55 










60 










Tyr 


Asp 


His 


Phe 


Phe 


Pro 


Val 


Ser 


His 


He 


Arg 


Leu 


Trp 


Ala 


Leu 


Gin 


65 










70 










75 










80 


Leu 


He 


Phe 


Val 


Ser 


Thr 


Pro 


Ala 


Leu 


Leu 


Val 


Ala 


Met 


His 


Val 


Ala 










85 










90 










95 




Tyr 


Tyr 


Arg 


His 


Glu 


Thr 


Thr 


Arg 


Lys 


Phe 


Arg 


Arg 


Gly 


Glu 


Lys 


Arg 








100 










105 










110 






Asn 


Asp 


Phe 


Lys 


Asp 


He 


Glu 


Asp 


He 


Lys 


Lys 


Gin 


Lys 


Val 


Arg 


He 






115 










120 










125 








Glu 


Gly 


Ser 


Leu 


Trp 


Trp 


Thr 


Tyr 


Thr 


Ser 


Ser 


He 


Phe 


Phe 


Arg 


He 




130 










135 










140 










He 


Phe 


Glu 


Ala 


Ala 


Phe 


Met 


Tyr 


Val 


Phe 


Tyr 


Phe 


Leu 


Tyr 


Asn 


Gly 


145 










150 










155 










160 


Tyr 


His 


Leu 


Pro 


Trp 


Val 


Leu 


Lys 


Cys 


Gly 


He 


Asp 


Pro 


Cys 


Pro 


Asn 










165 










170 










175 




Leu 


Val 


Asp 


Cys 


Phe 


He 


Ser 


Arg 


Pro 


Thr 


Glu 




Thr 


Val 


Phe 


Thr 








180 










185 










190 






He 


Phe 


Met 


He 


Ser 


Ala 


Ser 


Val 


He 


Cys 


Met 


Leu 


Leu 


Asn 


Val 


Ala 






195 










200 










205 








Glu 


Leu 


Cys 


Tyr 


Leu 


Leu 


Leu 


Lys 


Val 


Cys 


Phe 


Arg 


Arg 


Ser 


Lys 


Arg 




210 










215 










220 










Ala 


Gin 


Thr 


Gin 


Lys 


Asn 


His 


Pro 


Asn 


His 


Ala 


Leu 


Lys 


Glu 


Ser 


Lys 


225 










230 










235 










240 


Gin 


Asn 


Glu 


Met 


Asn 


Glu 


Leu 


He 


Ser 


Asp 


Ser 


Gly 


Gin 


Asn 


Ala 


He 










245 










250 










255 




Thr 


Gly 


Ser 


Gin 


Ala 


Lys 


His 


Phe 


Lys 


Val 


Lys 


Cys 


Ser 


Cys 


Val 


He 








260 










265 










270 






Arg 


Arg 


Leu 




Ser 


Ser 


Pro 


Glu 


Gly 


Asn 


Thr 


Asn 


Leu 


Lys 


Val 


Pro 
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275 280 
Ser Val Ala 
290 



<210> 160 
<211> 3951 
<212> DNA 
<213> Homo sapiens 

<400> 160 

tctgcatcca tattgaaaac ctgacacaat 
gaggcttctc tacaacatga cccaaaggag 
tgtgactctc ctggttgcct taagttcaga 
tcaagacaat gggtataatg gattgctcat 
gaacctcatc tcaaacatta aggaaatgat 
taccaagaga agagtatttt tcagaaatat 
taataataac agcaaaataa aacaagaatc 
ctggtatggg gcacatggag atgatccata 
gggaaaatac attcatttca cacctaattt 
cggatcacga ggccgagtgt ttgtccatga 
tgagtataac aatgacaaac ctttctacat 
gtgttcatct gacatcacag gcatttttgt 
ctgtattatt agtaagcttt ttaaagaagg 
tgcaactgca tcaataatgt tcatgcaaag 
aagtacccac aaccaagaag caccaaacct 
atgggatgta atcacagact ctgctgactt 
gcttccacct cctcccacat tctcgcttgt 
gctggatgtg tccagcaaga tggcagaggc 
agaattttat ttgatgcaga ttgttgaaat 
cagcaaagga gagatcagag cccagctaca 
gctggtttca tatctgccca ccactgtatc 
gcttaagaaa ggatttgagg tggttgaaaa 
gatattagtg accagcggag atgataagct 
cagtggttca acaattcact ccattgccct 
attatcacgt cttacaggag gtttaaagtt 
catgattgat gctttcagta gaatttcctc 
tcagcttgaa agtacaggtg aaaatgtcaa 
tgtggataat actgtgggca acgacactat 
tcctgagatt atattatttg atcctgatgg 
caatctaact tttcggacag ctagtctttg 
gacttacacc ctgaacaata cccatcattc 
tcgcgcctcc aactcagctg tgcccccagc 
cctccatttt cctcatcctg tgatgattta 
tcttaatgcc actgtcactg ccacagttga 
actccttgat gatggagcag gtgctgatgt 
ttttttctcc tttgctgcaa atggtagata 
cagcataagc accccagccc actctattcc 
cacagcaaac ggtaatattc agatgaatgc 
ggagcgaaag tggggcttta gccgagtcag 
tccagctggc ccccaccctg atgtgtttcc 
aaaagtagaa gaggaattga ccctatcttg 
ccaggctaca agctatgaaa taagaatgag 
taacaatgct attttagtaa atacatcaaa 
gatatttacg ttctcacccc aaatttccac 
aacacatgaa agccacagaa tttatgttgc 
gtctgctgta tctaacattg cccaggcgcc 
acctgccaga gattatctta tattgaaagg 



285 



gtatgcagca ggctcagtgt gagtgaactg 60 
cattgcaggt cctatttgca acctgaagtt 120 
actcccattc ctgggagctg gagtacagct 180 
tgcaattaat cctcaggtac ctgagaatca 240 
aactgaagct tcattttacc tatttaatgc 300 
aaagatttta atacctgcca catggaaagc 360 
atatgaaaag gcaaatgtca tagtgactga 420 
caccctacaa tacagagggt gtggaaaaga 480 
cctactgaat gataacttaa cagctggcta 540 
atgggcccac ctccgttggg gtgtgttcga 600 
aaatgggcaa aatcaaatta aagtgacaag 660 
gtgtgaaaaa ggtccttgcc cccaagaaaa 720 
atgcaccttt atctacaata gcacccaaaa 780 
tttatcttct gtggttgaat tttgtaatgc 840 
acagaaccag atgtgcagcc tcagaagtgc 900 
tcaccacagc tttcccatga acgggactga 960 
agaggctggt gacaaagtgg tctgtttagt 1020 
tgacagactc cttcaactac aacaagccgc 1080 
tcataccttc gtgggcattg ccagtttcga 1140 
ccaaattaac agcaatgatg atcgaaagtt 1200 
agctaaaaca gacatcagca tttgttcagg 12 60 
actgaatgga aaagcttatg gctctgtgat 1320 
tcttggcaat tgcttaccca ctgtgctcag 1380 
gggttcatct gcagccccaa atctggagga 14 40 
ctttgttcca gatatatcaa actccaatag 1500 
tggaactgga gacattttcc agcaacatat 1560 
acctcaccat caattgaaaa acacagtgac 1620 
gtttctagtt acgtggcagg ccagtggtcc 1680 
acgaaaatac tacacaaata attttatcac 17 40 
gattccagga acagctaagc ctgggcactg 1800 
tctgcaagcc ctgaaagtga cagtgacctc 18 60 
cactgtggaa gcctttgtgg aaagagacag 1920 
tgccaatgtg aaacagggat tttatcccat 1980 
gccagagact ggagatcctg ttacgctgag 2 04 0 
tataaaaaat gatggaattt actcgaggta 2100 
tagcttgaaa gtgcatgtca atcactctcc 2160 
agggagtcat gctatgtatg taccaggtta 2220 
tccaaggaaa tcagtaggca gaaatgagga 22 8 0 
ctcaggaggc tccttttcag tgctgggagt 2340 
accatgcaaa attattgacc tggaagctgt 2 4 00 
gacagcacct ggagaagact ttgatcaggg 24 60 
taaaagtcta cagaatatcc aagatgactt 2520 
gcgaaatcct cagcaagctg gcatcaggga 2580 
gaatggacct gaacatcagc caaatggaga 2 640 
aatacgagca atggatagga actccttaca 2700 
tctgtttatt ccccccaatt ctgatcctgt 27 60 
agttttaaca gcaatgggtt tgataggaat 2820 
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catttgcctt attatagttg tgacacatca tactttaagc aggaaaaaga gagcagacaa 28 8 0 

gaaagagaat ggaacaaaat tattataaat aaatatccaa agtgtcttcc ttcttagata 2940 

taagacccat ggccttcgac tacaaaaaca tactaacaaa gtcaaattaa catcaaaact 3000 

gtattaaaat gcattgagtt tttgtacaat acagataaga tttttacatg gtagatcaac 3060 

aaattctttt tgggggtaga ttagaaaacc cttacacttt ggctatgaac aaataataaa 312 0 

aattattctt taaagtaatg tctttaaagg caaagggaag ggtaaagtcg gaccagtgtc 3180 

aaggaaagtt tgttttattg aggtggaaaa atagccccaa gcagagaaaa ggagggtagg 3240 

tctgcattat aactgtctgt gtgaagcaat catttagtta ctttgattaa tttttctttt 3300 

ctccttatct gtgcagaaca ggttgcttgt ttacaactga agatcatgct atatttcata 33 60 

tatgaagccc ctaatgcaaa gctctttacc tcttgctatt ttgttatata tattacagat 3420 

gaaatctcac tgctaatgct cagagatctt ttttcactgt aagaggtaac ctttaacaat 3480 

atgggtatta cctttgtctc ttcataccgg ttttatgaca aaggtctatt gaatttattt 3540 

gtttgtaagt ttctactccc atcaaagcag ctttttaagt tattgccttg gttattatgg 3600 

atgatagtta tagcccttat aatgccttaa ctaaggaaga aaagatgtta ttctgagttt 3660 

gttttaatac atatatgaac atatagtttt attcaattaa accaaagaag aggtcagcag 3720 

ggagatacta acctttggaa atgattagct ggctctgttt tttggttaaa taagagtctt 3780 

taatcctttc tccatcaaga gttacttacc aagggcaggg gaagggggat atagaggtcc 384 0 

caaggaaata aaaatcatct ttcatcttta attttactcc ttcctcttat ttttttaaaa 3900 

gattatcgaa caataaaatc atttgccttt ttaattaaaa acataaaaaa a 3951 

<210> 161 
<211> 943 
<212> PRT 

<213> Homo sapiens 



<400> 161 



Met 


Thr 


Gin 


Arg 


Ser 


He 


Ala 


Gly 


Pro 


He 


Cys 


Asn 


Leu 


Lys 


Phe 


Val 


1 








5 










10 










15 




Thr 


Leu 


Leu 


Val 


Ala 


Leu 


Ser 


Ser 


Glu 


Leu 


Pro 


Phe 


Leu 


Gly 


Ala Gly 








20 










25 










30 






Val 


Gin 


Leu 
35 


Gin 


Asp 


Asn 


Gly 


Tyr 
40 


Asn 


Gly 


Leu 


Leu 


He 
45 


Ala 


He 


Asn 


Pro 


Gin 


Val 


Pro 


Glu 


Asn 


Gin 


Asn 


Leu 


He 


Ser 


Asn 


He 


Lys 


Glu 


Met 




50 










55 










60 








He 


Thr 


Glu 


Ala 


Ser 


Phe 


Tyr 


Leu 


Phe 


Asn 


Ala 


Thr 


Lys 


Arg 


Arg 


Val 


65 










70 










75 










80 


Phe 


Phe 


Arg 


Asn 


lie 
85 


Lys 


He 


Leu 


He 


Pro 
90 


Ala 


Thr 


Trp 


Lys 


Ala 
95 


Asn 


Asn 


Asn 


Ser 


Lys 
100 


He 


Lys 


Gin 


Glu 


Ser 
105 


Tyr 


Glu 


Lys 


Ala 


Asn 
110 


Val 


He 


Val 


Thr 


Asp 
115 


Trp 


Tyr 


Gly 


Ala 


His 
120 


Gly 


Asp 


Asp 


Pro 


Tyr 
125 


Thr 


Leu 


Gin 


Tyr 


Arg 
130 


Gly 


Cys 


Gly 


Lys 


Glu 
135 


Gly 


Lys 


Tyr 


He 


His 
140 


Phe 


Thr 


Pro 


Asn 


Phe 


Leu 


Leu 


Asn 


Asp 


Asn 




Thr 


Ala 


Gly 


Tyr 


Gly 


Ser 


Arg 


Gly 


Arg 


145 










150 










155 










160 


Val 


Phe 


Val 


His 


Glu 
165 


Trp 


Ala 


His 


Leu 


Arg 
170 


Trp 


Gly 


Val 


Phe 


Asp 
175 


Glu 


Tyr 


Asn 


Asn 


Asp 
180 


Lys 


Pro 


Phe 


Tyr 


He 
185 


Asn 


Gly 


Gin 


Asn 


Gin 
190 


He 


Lys 


Val 


Thr 


Arg 
195 


Cys 


Ser 


Ser 


Asp 


He 

200 


Thr 


Gly 


He 


Phe 


Val 
205. 


Cys 


Glu 


Lys 


Gly 


Pro 
210 


Cys 


Pro 


Gin 


Glu 


Asn 
215 


Cys 


He 


He 


Ser 


Lys 
220 


Leu 


Phe 


Lys 


Glu 


Gly 


Cys 


Thr 


Phe 


He 


Tyr 


Asn 


Ser 


Thr 


Gin 


Asn 


Ala 


Thr 


Ala 


Ser 


He 


225 










230 










235 










240 


Met 


Phe 


Met 


Gin 


Ser 
245 


Leu 


Ser 


Ser 


Val 


Val 
250 


Glu 


Phe 


Cys 


Asn 


Ala 
255 


Ser 



WO 02/47534 



78 



PCT7US01/47576 



Thr 


His 


Asn 


Gin 


Glu 


Ala 


Pro 


Asn 


Leu 


Gin 


Asn 


Gin 


Met 


Cys 


Ser 


Leu 








260 










265 










270 






Arg 


Ser 


Ala 


Trp 


Asp 


Val 


He 


Thr 




Ser 


Ala 


Asp 


Phe 


His 


His 


Ser 






275 










280 










285 








Phe 


Pro 


Met 


Asn 


Gly 


Thr 


Glu 


Leu 


Pro 


Pro 


Pro 


Pro 


Thr 


Phe 


Ser 


Leu 




290 




Gly 






295 










300 










Val 


Glu 


Ala 


Asp 


Lys 


Val 


Val 


Cys 


Leu 


Val 


Leu 


Asp 


Val 


Ser 


Ser 


305 










310 










315 










320 


Lys 


Met 


Ala 


Glu 


Ala 


Asp 


Arg 


Leu 


Leu 


Gin 


Leu 


Gin 


Gin 


Ala 


Ala 


Glu 










325 










330 










335 




Phe 


Tyr 


Leu 


Met 


Gin 


lie 


Val 


Glu 


He 


His 


Thr 


Phe 


Val 


Gly 


He 


Ala 








.340 










345 










350 






Ser 


Phe 


Asp 


Ser 


Lys 


Gly 


Glu 


He 


Arg 


Ala 


Gin 


Leu 


His 


Gin 


He 


Asn 






355 










360 










365 








Ser 


Asn 


Asp 


Asp 


Arg 


Lys 


Leu 


Leu 


Val 


Ser 


Tyr 


Leu 


Pro 


Thr 


Thr 


Val 




370 










375 










380 










Ser 


Ala 


Lys 


Thr 


Asp 


He 


Ser 


He 


Cys 


Ser 


Gly 


Leu 


Lys 


Lys 


Gly 


Phe 


385 










390 










395 










400 


Glu 


Val 


Val 


Glu 


Lys 


Leu 


Asn 


Gly 


Lys 


Ala 


Tyr 


Gly 


Ser 


Val 


Met 


He 










405 










410 










415 




Leu 


Val 


Thr 


Ser 


Gly 


Asp 


Asp 


Lys 


Leu 


Leu 


Gly 


Asn 


Cys 


Leu 


Pro 


Thr 








420 










425 










430 






Val 


Leu 


Ser 


Ser 


Gly 


Ser 


Thr 


He 


His 


Ser 


He 


Ala 


Leu 


Gly 


Ser 


Ser 






435 










440 










445 








Ala 


Ala 


Pro 


Asn 


Leu 


Glu 


Glu 


Leu 


Ser 


Arg 


Leu 


Thr 


Gly 


Gly 


Leu 


Lys 




450 










455 










460 










Phe 


Phe 


Val 


Pro 


Asp 


He 


Ser 


Asn 


Ser 


Asn 


Ser 


Met 


He 


Asp 


Ala 


Phe 


465 










470 










475 










480 


Ser 


Arg 


lie 


Ser 


Ser 


Gly 


Thr 


Gly 


Asp 


He 


Phe 


Gin 


Gin 


His 


He 


Gin 










485 










490 










495 




Leu 


Glu 


Ser 


Thr 


Gly 


Glu 


Asn 


Val 


Lys 


Pro 


His 


His 


Gin 


Leu 


Lys 


Asn 








500 










505 










510 






Thr 


Val 


Thr 


Val 


Asp 


Asn 


Thr 


Val 


Gly 


Asn 


Asp 


Thr 


Met 


Phe 


Leu 


Val 






515 










520 










525 








Thr 


Trp 


Gin 


Ala 


Ser 


Gly 


Pro 


Pro 


Glu 


He 


He 


Leu 


Phe 


Asp 


Pro 


Asp 




530 










535 










540 










Gly 


Arg 


Lys 


Tyr 


Tyr 


Thr 


Asn 


Asn 


Phe 


He 


Thr 


Asn 


Leu 


Thr 


Phe 


Arg 


545 










550 










555 










560 


Thr 


Ala 


Ser 


Leu 


Trp 


He 


Pro 


Gly 


Thr 


Ala 


Lys 


Pro 


Gly 


His 


Trp 


Thr 










565 










570 










575 




Tyr 


Thr 


Leu 


Asn 


Asn 


Thr 


His 


His 


Ser 


Leu 


Gin 


Ala 


Leu 


Lys 


Val 


Thr 








580 










585 










590 






Val 


Thr 


Ser 


Arg 


Ala 


Ser 


Asn 


Ser 


Ala 


Val 


Pro 


Pro 


Ala 


Thr 


Val 


Glu 






595 










600 










605 








Ala 


Phe 


Val 


Glu 


Arg 


Asp 


Ser 


Leu 


His 


Phe 


Pro 


His 


Pro 


Val 


Met 


He 




610 










615 










620 










Tyr 


Ala 


Asn 


Val 


Lys 


Gin 


Gly 


Phe 


Tyr 


Pro 


He 


Leu 


Asn 


Ala 


Thr 


Val 


625 










630 










635 










640 


Thr 


Ala 


Thr 


Val 


Glu 


Pro 


Glu 


Thr 


Gly 


Asp 


Pro 


Val 


Thr 


Leu 


Arg 


Leu 










645 










650 










655 




Leu 


Asp 


Asp 


Gly 


Ala 


Gly 


Ala 


Asp 


Val 


He 


Lys 


Asn 


Asp 


Gly 


He 


Tyr 








660 










665 










670 






Ser 


Arg 


Tyr 


Phe 


Phe 


Ser 


Phe 


Ala 


Ala 


Asn 


Gly 


Arg 


Tyr 


Ser 


Leu 


Lys 






675 










680 










685 








Val 


His 


Val 


Asn 


His 


Ser 


Pro 


Ser 


He 


Ser 


Thr 


Pro 


Ala 


His 


Ser 


He 




690 










695 










700 










Pro 


Gly 


Ser 


His 


Ala 


Met 


Tyr 


Val 


Pro 


Gly 


Tyr 


Thr 


Ala 


Asn 


Gly 


Asn 


705 










710 










715 










720 
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He 


Gin 


Met 


Asn 


Ala 


Pro Arg 


Lys 










725 








Arg 


Lys 


Trp 


Gly 


Phe 


Ser Arg 


Val 








740 










Leu 


Gly 


Val 


Pro 


Ala 


Gly 


Pro 


His 






755 










760 


He 


He 


Asp 


Leu 


Glu 


Ala 


Val 


Lys 




770 










775 




Trp 


Thr 


Ala 


Pro 


Gly 


Glu 


Asp 


Phe 


785 










790 






Glu 


He 


Arg Met 


Ser 


Lys 


Ser 


Leu 










805 








Asn 


Ala 


He 


Leu 


Val 


Asn 


Thr 


Ser 








820 










He 


Arg 


Glu 


He 


Phe 


Thr 


Phe 


Ser 






835 










840 


Glu 


His 


Gin 


Pro 


Asn 


Gly Glu 


Thr 




850 










855 




Ala 


He 


Arg Ala Met 


Asp 


Arg 


Asn 


865 










870 






He 


Ala 


Gin 


Ala 


Pro 


Leu 


Phe 


He 










885 








Ala 


Arg 


Asp 


Tyr Leu 


He 


Leu 


Lys 








900 










He 


Gly 


He 


He 


Cys 


Leu 


He 


He 






915 










920 


Arg 


Lys 


Lys Arg Ala 


Asp 


Lys 


Lys 




930 










935 





Ser 


Val 


Gly 


Arg 


Asn 


Glu 


Glu 


Glu 




730 










735 




Ser 


Ser 


Gly 


Gly 


Ser 


Phe 


Ser 


Val 


745 










750 






Pro 


Asp 


Val 


Phe 


Pro 


Pro 


Cys 


Lys 










765 








Val 


Glu 


Glu 


Glu 


Leu 


Thr 


Leu 


Ser 








780 










Asp 


Gin 




Gin 


Ala 


Thr 


Ser 








795 










800 


Gin 


Asn 


He 


Gin 


Asp 


Asp 


Phe 


Asn 




810 










815 




Lys 


Arg 


Asn 


Pro 


Gin 


Gin 


Ala 


Gly 


825 










830 






Pro 


Gin 


He 


Ser 


Thr 


Asn 


Gly 


Pro 


His 


Glu 


Ser 


His 


845 
Arg 


He 


Tyr 


Val 








860 










Ser 


Leu 


Gin 


Ser 


Ala 


Val 


Ser 


Asn 






875 










880 


Pro 


Pro 


Asn 


Ser 


Asp 


Pro 


Val 


Pro 




890 










895 




Gly 


Val 


Leu 


Thr 


Ala 


Met 


Gly 


Leu 


905 










910 






Val 


Val 


Thr 


His 


His 


Thr 


Leu 


Ser 










925 








Glu 


Asn 


Gly 


Thr 


Lys 


Leu 


Leu 





940 



<210> 162 
<211> 498 
<212> DNA 
<213> Homo sapiens 



<400> 162 

tggagaacca cgtggacagc accatgaaca 
agcccctcaa gtcgggtatg aaggagctgg 
accggcagat gggcaagggt ggcaagcatc 
gaccaccccc tgccaggact ccctgccaac 
ccaccatgcg ccttccggat gagcggggcc 
ccaactgtga caagcatggc ctgtacaacc 
cagcgtgggg agtgctggtg tgtgaacccc 
accatccggg gggaccccga gtgtcatctc 
gtgcacaccc cagcggat 

<210> 163 
<211> 1128 
<212> DNA 
<213> Homo sapiens 



tgttgggcgg gggaggcagt gctggccgga 60 

ccgtgttccg ggagaaggtc actgagcagc 120 

accttggcct ggaggagccc aagaagctgc 180 

aggaactgga ccaggtcctg gagcggatct 240 

ctctggagca cctctactcc ctgcacatcc 3 00 

tcaaacagtg gcaagatgtc tctgaacggg 360 

aacaccggga agctgatcca gggagccccc 420 

ttctacaatg agcagcagga ggctcgcggg 480 
498 



<400> 163 

gccacctggc cctcctgatc gacgacacac 
aatcaacttt ccggaagcaa ccagcccacc 
tgcagcggag actggttcag cagtggagcg 
cctcctgcgg gcgctcggtg gagggtctca 
atcagctcct ccatgacaag gggaagtcca 
accatctgat cgcagaaatc cacacagctg 



gcacttgaaa cttgttctca gggtgtgtgg 60 
agaggaggtc ccgagcgcga gcggagacga 12 0 
tcgcggtgtt cctgctgagc tacgcggtgc 18 0 
gccgccgcct caaaagagct gtgtctgaac 240 
tccaagattt acggcgacga ttcttccttc 300 
aaatcagagc tacctcggag gtgtccccta 360 
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actccaagcc ctctcccaac acaaagaacc 
gcagatacct aactcaggaa actaacaagg 
cacctgggaa gaaaaagaaa ggcaagcccg 
ggcgaactcg ctctgcctgg ttagactctg 
acctgtctga cacctccaca acgtcgctgg 
agcagagacc ttccaaggac atattgcagg 
tagaaatatt tattgtctgt aaatactgta 
tgctctatga aactgcacat tggtcattgt 
aattattatt atcacattta ccataattta 
tgtatcttgg tgctgctgaa tttctatatt 
tatcaagtat gttgataaat gacacaatga 
aatgcctaaa tataattatc caaattgatt 
ttaaatttgt aaagaatgtc taataaaata 

<210> 164 

<211> 1310 

<212> DNA 

<213> Homo sapiens 



accccgtccg atttgggtct gatgatgagg 42 0 
tggagacgta caaagagcag ccgctcaaga 48 0 
ggaaacgcaa ggagcaggaa aagaaaaaac 54 0 
gagtgactgg gagtgggcta gaaggggacc 600 
agctcgattc acggaggcat tgaaattttc 660 
attctgtaat agtgaacata tggaaagtat 72 0 
aatgcattgg aataaaactg tctcccccat 780 
gaatattttt ttttttgcca aggctaatcc 84 0 
ttttgtccat tgatgtattt attttgtaaa 900 
ttttgtaaca taatgcactt tagatataca 960 
agtgtctcta ttttgtggtt gattttaatg 1020 
ttcctttgtg catgtaaaaa taacagtatt 1080 
taatctaatt acatcatg 1128 



<400> 164 

gggcctggtt cgcaaagaag ctgacttcag agggggaaac tttcttcttt taggaggcgg 60 

ttagccctgt tccacgaacc caggagaact gctggccaga ttaattagac attgctatgg 12 0 

gagacgtgta aacacactac ttatcattga tgcatatata aaaccatttt attttcgcta 18 0 

ttatttcaga ggaagcgcct ctgatttgtt tcttttttcc ctttttgctc tttctggctg 240 

tgtggtttgg agaaagcaca gttggagtag ccggttgcta aataagtccc gagcgcgagc 300 

ggagacgatg cagcggagac tggttcagca gtggagcgtc gcggtgttcc tgctgagcta 360 

cgcggtgccc tcctgcgggc gctcggtgga gggtctcagc cgccgcctca aaagagctgt 420 

gtctgaacat cagctcctcc atgacaaggg gaagtccatc caagatttac ggcgacgatt 48 0 

cttccttcac catctgatcg cagaaatcca cacagctgaa atcagagcta cctcggaggt 54 0 

gtcccctaac tccaagccct ctcccaacac aaagaaccac cccgtccgat ttgggtctga 600 

tgatgagggc agatacctaa ctcaggaaac taacaaggtg gagacgtaca aagagcagcc 660 

gctcaagaca cctgggaaga aaaagaaagg caagcccggg aaacgcaagg agcaggaaaa 72 0 

gaaaaaacgg cgaactcgct ctgcctggtt agactctgga gtgactggga gtgggctaga 780 

aggggaccac ctgtctgaca cctccacaac gtcgctggag ctcgattcac ggaggcattg 840 

aaattttcag cagagacctt ccaaggacat attgcaggat tctgtaatag tgaacatatg 900 

gaaagtatta gaaatattta ttgtctgtaa atactgtaaa tgcattggaa taaaactgtc 960 

tcccccattg ctctatgaaa ctgcacattg gtcattgtga atattttttt ttttgccaag 1020 

gctaatccaa ttattattat cacatttacc ataatttatt ttgtccattg atgtatttat 1080 

tttgtaaatg tatcttggtg ctgctgaatt tctatatttt ttgtaacata atgcacttta 1140 

gatatacata tcaagtatgt tgataaatga cacaatgaag tgtctctatt ttgtggttga 12 00 

ttttaatgaa tgcctaaata taattatcca aattgatttt cctttgtgcc cgtaaaaata 12 60 

acagtatttt aaatttgtaa agaatgtcta ataaaatata atctaattac 1310 

<210> 165 
<211> 177 
<212> PRT 

<213> Homo sapiens 
<400> 165 

Met Gin Arg Arg Leu Val Gin Gin Trp Ser Val Ala Val Phe Leu Leu 

15 10 15 

Ser Tyr Ala Val Pro Ser Cys Gly Arg Ser Val Glu Gly Leu Ser Arg 

20 25 30 

Arg Leu Lys Arg Ala Val Ser Glu His Gin Leu Leu His Asp Lys Gly 

35 40 45 

Lys Ser lie Gin Asp Leu Arg Arg Arg Phe Phe Leu His His Leu He 

50 55 60 

Ala Glu He His Thr Ala Glu He Arg Ala Thr Ser Glu Val Ser Pro 
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65 










70 




Asn 


Ser 


Lys 


Pro 


Ser 


Pro 


Asn Thr 










85 






Ser 


Asp 


Asp 


Glu 


Gly 


Arg 


Tyr Leu 








100 








Thr 


Tyr 


Lys 


Glu 


Gin 


Pro 


Leu Lys 






115 








120 


Lys 


Pro 


Gly 




Arg 


Lys 


Glu Gin 




130 










135 


Ser 


Ala 


Trp 


Leu 


Asp 


Ser 


Gly Val 


145 










150 




His 


Leu 


Ser 


Asp 


Thr 


Ser 


Thr Thr 










165 






His 



















75 




80 


Lys 


Asn 


His 


Pro Val Arg 


Phe Gly 




90 






95 


Thr 


Gin 


Glu 


Thr Asn Lys 


Val Glu 


105 






110 




Thr 


Pro 


Gly 


Lys Lys Lys 


Lys Gly 








125 




Glu 


Lys 


Lys 


Lys Arg Arg 


Thr Arg 








140 




Thr 


Gly 


Ser 


Gly Leu Glu 


Gly Asp 






155 




160 


Ser 


Leu 


Glu 


Leu Asp Ser Arg Arg 




170 






175 



<210> 166 
<211> 177 
<212> PRT 

<213> Homo sapiens 



<400> 166 



Met 


Gin 


Arg 


Arg Leu 


Val Gin Gin Trp 


Ser 


Val 


Ala Val Phe 


Leu Leu 


1 






5 




10 






15 


Ser 


Tyr 


Ala 


Val Pro 


Ser Cys Gly Arg 


Ser 


Val 


Glu Gly Leu 


Ser Arg 








20 


25 






30 




Arg 


Leu 


Lys 


Arg Ala 


Val Ser Glu His 


Gin 


Leu 


Leu His Asp 


Lys Gly 






35 




40 






45 




Lys 


Ser 


He 


Gin Asp 


Leu Arg Arg Arg 


Phe 


Phe 


Leu His His 


Leu He 




50 






55 






60 




Ala 


Glu 


He 


His Thr 


Ala Glu He Arg 


Ala 


Thr 


Ser Glu Val 


Ser Pro 


65 








70 




75 




80 


Asn 


Ser 


Lys 


Pro Ser 


Pro Asn Thr Lys 


Asn 


His 


Pro Val Arg 


Phe Gly 








85 




90 






95 


Ser 


Asp 


Asp 


Glu Gly 


Arg Tyr Leu Thr 


Gin 


Glu 


Thr Asn Lys 


Val Glu 








100 


105 






110 




Thr 


Tyr 


Lys 


Glu Gin 


Pro Leu Lys Thr 


Pro 


Gly 


Lys Lys Lys 


Lys Gly 






115 




120 






125 




Lys 


Pro 


Gly 


Lys Arg 


Lys Glu Gin Glu 


Lys 


Lys 


Lys Arg Arg 


Thr Arg 




130 






135' 






140 




Ser 


Ala 


Trp 


Leu Asp 


Ser Gly Val Thr 


Gly 


Ser 


Gly Leu Glu Gly Asp 


145 








150 




155 




160 


His 


Leu 


Ser 


Asp Thr 


Ser Thr Thr Ser 


Leu 


Glu 


Leu Asp Ser 


Arg Arg 



165 170 ~ 175 



His 



<210> 167 
<211> 3362 
<212> DNA 
<213> Homo sapiens 

<400> 167 

cacaatgtat gcagcaggct cagtgtgagt gaactggagg cttctctaca acatgaccca 60 
aaggagcatt gcaggtccta tttgcaacct gaagtttgtg actctcctgg ttgccttaag 12 0 
ttcagaactc ccattcctgg gagctggagt acagcttcaa gacaatgggt ataatggatt 18 0 
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gctcattgca attaatcctc aggtacctga 
aatgataact gaagcttcat tttacctatt 
aaatataaag attttaatac ctgccacatg 
agaatcatat gaaaaggcaa atgtcatagt 
tccatacacc ctacaataca gagggtgtgg 
taatttccta ctgaatgata acttaacagc 
ccatgaatgg gcccacctcc gttggggtgt 
ctacataaat gggcaaaatc aaattaaagt 
ttttgtgtgt gaaaaaggtc cttgccccca 
agaaggatgc acctttatct acaatagcac 
gcaaagttta tcttctgtgg ttgaattttg 
aaacctacag aaccagatgt gcagcctcag 
tgactttcac cacagctttc ccatgaacgg 
gcttgtagag gctggtgaca aagtggtctg 
agaggctgac agactccttc aactacaaca 
tgaaattcat accttcgtgg gcattgccag 
gctacaccaa attaacagca atgatgatcg 
tgtatcagct aaaacagaca tcagcatttg 
tgaaaaactg aatggaaaag cttatggctc 
taagcttctt ggcaattgct tacccactgt 
tgccctgggt tcatctgcag ccccaaatct 
aaagttcttt gttccagata tatcaaactc 
ttcctctgga actggagaca ttttccagca 
tgtcaaacct caccatcaat tgaaaaacac 
cactatgttt ctagttacgt ggcaggccag 
tgatggacga aaatactaca caaataattt 
tctttggatt ccaggaacag ctaagcctgg 
ccatgcaaaa ttattgacct ggaagctgta 
acagcacctg gagaagactt tgatcagggc 
aaaagtctac agaatatcca agatgacttt 
cgaaatcctc agcaagctgg catcagggag 
aatggacctg aacatcagcc aaatggagaa 
atacgagcaa tggataggaa ctccttacag 
ctgtttattc cccccaattc tgatcctgta 
gttttaacag caatgggttt gataggaatc 
actttaagca ggaaaaagag agcagacaag 
aatatccaaa gtgtcttcct tcttagatat 
actaacaaag tcaaattaac atcaaaactg 
cagataagat ttttacatgg tagatcaaca 
ttacactttg gctatgaaca aataataaaa 
aaagggaagg gtaaagtcgg accagtgtca 
tagccccaag cagagaaaag gagggtaggt 
atttagttac tttgattaat ttttcttttc 
tacaactgaa gatcatgcta tatttcatat 
cttgctattt tgttatatat attacagatg 
tttcactgta agaggtaacc tttaacaata 
tttatgacaa aggtctattg aatttatttg 
tttctaagtt attgccttgg ttattatgga 
taaggaagaa aagatgttat tctgagtttg 
ttcaattaaa ccaaagaaga ggtcagcagg 
gctctgtttt ttggttaaat aagagtcttt 
agggcagggg aagggggata tagaggtcac 
ttttactcct tcctcttatt tttttaaaag 
tt 



gaatcagaac ctcatctcaa acattaagga 24 0 
taatgctacc aagagaagag tatttttcag 300 
gaaagctaat aataacagca aaataaaaca 3 60 
gactgactgg tatggggcac atggagatga 42 0 
aaaagaggga aaatacattc atttcacacc 48 0 
tggctacgga tcacgaggcc gagtgtttgt 540 
gttcgatgag tataacaatg acaaaccttt 600 
gacaaggtgt tcatctgaca tcacaggcat 660 
agaaaactgt attattagta agctttttaa 720 
ccaaaatgca actgcatcaa taatgttcat 780 
taatgcaagt acccacaacc aagaagcacc 84 0 
aagtgcatgg gatgtaatca cagactctgc 900 
gactgagctt ccacctcctc ccacattctc 960 
tttagtgctg gatgtgtcca gcaagatggc 102 0 
agccgcagaa ttttatttga tgcagattgt 1080 
tttcgacagc aaaggagaga tcagagccca 1140 
aaagttgctg gtttcatatc tgcccaccac 1200 
ttcagggctt aagaaaggat ttgaggtggt 1260 
tgtgatgata ttagtgacca gcggagatga 132 0 
gctcagcagt ggttcaacaa ttcactccat 1380 
ggaggaatta tcacgtctta caggaggttt 14 4 0 
caatagcatg attgatgctt tcagtagaat 1500 
acatattcag cttgaaagta caggtgaaaa 1560 
agtgactgtg gataatactg tgggcaacga 1620 
tggtcctcct gagattatat tatttgatcc 1680 
tatcaccaat ctaacttttc ggacagctag 1740 
gcactggact tacaccctga tgtgtttcca 1800 
aaagtagaag aggaattgac cctatcttgg 1860 
caggctacaa gctatgaaat aagaatgagt 1920 
aacaatgcta ttttagtaaa tacatcaaag 1980 
atatttacgt tctcacccca aatttccacg 2040 
acacatgaaa gccacagaat ttatgttgca 2100 
tctgctgtat ctaacattgc ccaggcgcct 2160 
cctgccagag attatcttat attgaaagga 2220 
atttgcctta ttatagttgt gacacatcat 2280 
aaagagaatg gaacaaaatt attataaata 2340 
aagacccatg gccttcgact acaaaaacat 2400 
tattaaaatg cattgagttt ttgtacaata 24 60 
aattcttttt gggggtagat tagaaaaccc 2520 
attattcttt aaagtaatgt ctttaaaggc 2580 
aggaaagttt gttttattga ggtggaaaaa 2640 
ctgcattata actgtctgtg tgaagcaatc 2700 
tccttatctg tgcagaacag gttgcttgtt 27 60 
atgaagcccc taatgcaaag ctctttacct 2820 
aaatctcact gctaatgctc agagatcttt 2880 
tgggtattac ctttgtctct tcataccggt 2940 
tttgtaagtt tctactccca tcaaagcagc 3000 
tgatagttat agcccttata atgccttaac 3060 
ttttaataca tatatgaaca tatagtttta 3120 
gagatactaa cctttggaaa tgattagctg 3180 
aatcctttct ccatcaagag ttacttacca 3240 
aaggaaataa aaatcatctt tcatctttaa 3300 
attatcgaac aataaaatca tttgcctttt 3360 
3362 



<210> 168 
<211> 2784 
<212> DNA 
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<213> Homo sapiens 
<400> 168 

tctgcatcca tattgaaaac ctgacacaat gtatgcagca ggctcagtgt gagtgaactg 60 
gaggcttctc tacaacatga cccaaaggag cattgcaggt cctatttgca acctgaagtt 12 0 
tgtgactctc ctggttgcct taagttcaga actcccattc ctgggagctg gagtacagct 18 0 
tcaagacaat gggtataatg gattgctcat tgcaattaat cctcaggtac ctgagaatca 24 0 
gaacctcatc tcaaacatta aggaaatgat aactgaagct tcattttacc tatttaatgc 300 
taccaagaga agagtatttt tcagaaatat aaagatttta atacctgcca catggaaagc 3 60 
taataataac agcaaaataa aacaagaatc atatgaaaag gcaaatgtca tagtgactga 42 0 
ctggtatggg gcacatggag atgatccata caccctacaa tacagagggt gtggaaaaga 480 
gggaaaatac attcatttca cacctaattt cctactgaat gataacttaa cagctggcta 54 0 
cggatoacga ggccgagtgt ttgtccatga atgggcccac ctccgttggg gtgtgttcga 600 
tgagtataac aatgacaaac ctttctacat aaatgggcaa aatcaaatta aagtgacaag 660 
gtgttcatct gacatcacag gcatttttgt gtgtgaaaaa ggtccttgcc cccaagaaaa 72 0 
ctgtattatt agtaagcttt ttaaagaagg atgcaccttt atctacaata gcacccaaaa 78 0 
tgcaactgca tcaataatgt tcatgcaaag tttatcttct gtggttgaat tttgtaatgc 84 0 
aagtacccac aaccaagaag caccaaacct acagaaccag atgtgcagcc tcagaagtgc 90 0 
atgggatgta atcacagact ctgctgactt tcaccacagc tttcccatga acgggactga 960 
gcttccacct cctcccacat tctcgcttgt agaggctggt gacaaagtgg tctgtttagt 1020 
gctggatgtg tccagcaaga tggcagaggc tgacagactc cttcaactac aacaagccgc 1080 
agaattttat ttgatgcaga ttgttgaaat tcataccttc gtgggcattg ccagtttcga 1140 
cagcaaagga gagatcagag cccagctaca ccaaattaac agcaatgatg atcgaaagtt 1200 
gctggtttca tatctgccca ccactgtatc agctaaaaca gacatcagca tttgttcagg 12 60 
gcttaagaaa ggatttgagg tggttgaaaa actgaatgga aaagcttatg gctctgtgat 1320 
gatattagtg accagcggag atgataagct tcttggcaat tgcttaccca ctgtgctcag 1380 
cagtggttca acaattcact ccattgccct gggttcatct gcagccccaa atctggagga 14 40 
attatcacgt cttacaggag gtttaaagtt ctttgttcca gatatatcaa actccaatag 1500 
catgattgat gctttcagta gaatttcctc tggaactgga gacattttcc agcaacatat 1560 
tcagcttgaa agtacaggtg aaaatgtcaa acctcaccat caattgaaaa acacagtgac 1620 
tgtggataat actgtgggca acgacactat gtttctagtt acgtggcagg ccagtggtcc 1680 
tcctgagatt atattatttg atcctgatgg acgaaaatac tacacaaata attttatcac 17 4 0 
caatctaact tttcggacag ctagtctttg gattccagga acagctaagc ctgggcact'g 1800 
gacttacacc ctgaacaata cccatcattc tctgcaagcc ctgaaagtga cagtgacctc 18 60 
tcgcgcctcc aactcagctg tgcccccagc cactgtggaa gcctttgtgg aaagagacag 1920 
cctccatttt cctcatcctg tgatgattta tgccaatgtg aaacagggat tttatcccat 1980 
tcttaatgcc actgtcactg ccacagttga gccagagact ggagatcctg ttacgctgag 2040 
actccttgat gatggagcag gtgctgatgt tataaaaaat gatggaattt actcgaggta 2100 
ttttttctcc tttgctgcaa atggtagata tagcttgaaa gtgcatgtca atcactctcc 2160 
cagcataagc accccagccc actctattcc agggagtcat gctatgtatg taccaggtta 2220 
cacagcaaac ggtaatattc agatgaatgc tccaaggaaa tcagtaggca gaaatgagga 22 8 0 
ggagcgaaag tggggcttta gccgagtcag ctcaggaggc tccttttcag tgctgggagt 2340 
tccagctggc ccccaccctg atgtgtttcc accatgcaaa attattgacc tggaagctgt 2400 
aaatagaaga ggaattgacc ctatcttgga cagcacctgg agaagacttt gatcagggcc 24 60 
aggctacaag ctatgaaata agaatgagta aaagtctaca gaatatccaa gatgacttta 252 0 
acaatgctat tttagtaaat acatcaaagc gaaatcctca gcaagctggc atcagggaga 25 8 0 
tatttacgtt ctcaccccaa atttccacga atggacctga acatcagcca aatggagaaa 2640 
cacatgaaag ccacagaatt tatgttgcaa tacgagcaat ggataggaac tccttacagt 27 00 
ctgctgtatc taacattgcc caggcgcctc tgtttattcc ccccaattct gatcctgtac 27 60 
ctgccagaga ttatcttata ttga 27 84 

<210> 169 
<211> 592 
<212> PRT 

<213> Homo sapiens 
<400> 169 

Met Thr Gin Arg Ser lie Ala Gly Pro He Cys Asn Leu Lys Phe Val 
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1 






5 






Thr 


Leu 


Leu 


Val Ala Leu 


Ser 


Ser 








20 






Val 


Gin 


Leu 


Gin Asp Asn 


Gly Tyr 






35 






40 


Pro 


Gin 


Val 


Pro Glu Asn 


Gin 


Asn 




50 






55 




He 


Thr 


Glu 


Ala Ser Phe 


Tyr 


Leu 


65 






70 






Phe 


Phe 


Arg 


Asn He Lys 


He 


Leu 








85 






Asn 


Asn 


Ser 


Lys He Lys 


Gin 


Glu 








100 






Val 


Thr 


Asp 


Trp Tyr Gly 


Ala 


His 






115 






120 


Tyr 


Arg 


Gly 


Cys Gly Lys 


Glu Gly 




130 






135 




Phe 


Leu 


Leu 


Asn Asp Asn 


Leu 


Thr 


145 






150 






Val 


Phe 


Val 


His Glu Trp 


Ala 


His 








165 






Tyr 


Asn 


Asn 


Asp Lys Pro 


Phe 


Tyr 








180 






Val 


Thr 


Arg 


Cys Ser Ser 


Asp 


He 






195 






200 


Gly 


Pro 


Cys 


Pro Gin Glu 


Asn Cys 




210 






215 




Gly 


Cys 


Thr 


Phe He Tyr 


Asn 


Ser 


225 






230 






Met 


Phe 


Met 


Gin Ser Leu 


Ser 


Ser 








245 






Thr 


His 


Asn 


Gin Glu Ala 


Pro 


Asn 








260 






Arg 


Ser 


Ala 


Trp Asp Val 


He 


Thr 






275 






280 


Phe 


Pro 


Met 


Asn Gly Thr 


Glu 


Leu 




290 






295 




Val 


Glu 


Ala 


Gly Asp Lys 


Val 


Val 


305 






310 






Lys 


Met 


Ala 


Glu Ala Asp 


Arg 


Leu 








325 






Phe 


Tyr 


Leu 


Met Gin He 


Val 


Glu 








340 






Ser 


Phe 


Asp 


Ser Lys Gly 


Glu 


He 






355 






360 


Ser 


Asn 


Asp 


Asp Arg Lys 


Leu 


Leu 




370 






375 




Ser 


Ala 


Lys 


Thr Asp He 


Ser 


He 


385 






390 






Glu 


Val 


Val 


Glu Lys Leu 


Asn 


Gly 








405 






Leu 


Val 


Thr 


Ser Gly Asp 


Asp Lys 








420 






Val 


Leu 


Ser 


Ser Gly Ser 


Thr 


He 






435 






440 


Ala 


Ala 


Pro 


Asn Leu Glu 


Glu 


Leu 




450 






455 




Phe 


Phe 


Val 


Pro Asp He 


Ser 


Asn 



10 15 



Glu 


Leu 


Pro 


Phe 


Leu 


Gly 


Ala 


Gly 


25 










30 






Asn Gly Leu 


Leu 


He 


Ala 


He 


Asn 










45 








Leu 


He 


Ser 


Asn 


He 


Lys 


Glu 


Met 








60 










Phe 


Asn 


Ala 


Thr 


Lys 


Arg 


Arg 


Val 






75 










80 


He 


Pro 


Ala 


Thr 


Trp 


Lys 


Ala 


Asn 




90 










95 




Ser 


Tyr 


Glu 


Lys Ala Asn Val 


He 


105 










110 






Gly Asp Asp 


Pro 


Tyr 


Thr 


Leu 


Gin 










125 








Lys 


Tyr 


He 


His 


Phe 


Thr 


Pro 


Asn 








140 










Ala 


Gly 


Tyr 


Gly 


Ser 


Arg 


Gly 


Arg 






155 










160 


Leu 


Arg Trp 


Gly Val 


Phe 


Asp 


Glu 




170 










175 




He Asn Gly 


Gin Asn 


Gin 


He 


Lys 


185 










190 






Thr Gly He 


Phe 


Val 


Cys 


Glu 


Lys 










205 








He 


He 


Ser 


Lys 


Leu 


Phe 


Lys 


Glu 








220 










Thr 


Gin 


Asn 


Ala 


Thr 


Ala 


Ser 


He 






235 










240 


Val 


Val 


Glu 


Phe 


Cys 


Asn 


Ala 


Ser 




250 










255 




Leu 


Gin 


Asn 


Gin 


Met 


Cys 


Ser 


Leu 


265 










270 






Asp 


Ser 


Ala 


Asp 


Phe 


His 


His 


Ser 










285 








Pro 


Pro 


Pro 


Pro 


Thr 


Phe 


Ser 


Leu 








300 










Cys 


Leu 


Val 




Asp 


Val 


Ser 


Ser 






315 










320 


Leu 


Gin 


Leu 


Gin 


Gin 


Ala 


Ala 


Glu 




330 










335 




He 


His 


Thr 


Phe Val Gly He 


Ala 


345 










350 






Arg Ala 


Gin 


Leu 


His 


Gin 


He 


Asn 










365 








Val 


Ser 


Tyr 


Leu 


Pro 


Thr 


Thr 


Val 








380 










Cys 


Ser 


Gly 






Lys 


Gly 


Phe 






395 










400 


Lys 


Ala 


Tyr 


Gly 


Ser 


Val 


Met 


He 




410 










415 




Leu Leu Gly 


Asn Cys 


Leu 


Pro 


Thr 


425 










430 






His 


Ser 


He 


Ala Leu Gly Ser 


Ser 










445 








Ser 


Arg 




Thr 


Gly Gly 


Leu 


Lys 








460 










Ser 


Asn 


Ser 


Met 


He Asp 


Ala 


Phe 
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465 










470 






475 








480 


Ser 


Arg 


He 


Ser 


Ser 


Gly 


Thr 


Gly Asp 


He Phe 


Gin 


Gin His 


He 


Gin 










485 








490 






495 




Leu 


Glu 


Ser 


Thr 


Gly 


Glu 


Asn 


Val Lys 


Pro His 


His 


Gin Leu 


Lys 


Asn 








500 








505 






510 






Thr 


Val 


Thr 


Val 


Asp 


Asn 


Thr 


Val Gly 


Asn Asp 


Thr 


Met Phe 


Leu 


Val 






515 










520 






525 






Thr 


Trp 


Gin 


Ala 


Ser 


Gly 


Pro 


Pro Glu 


He He 


Leu 


Phe Asp 


Pro 


Asp 




530 










535 






540 








Gly 


Arg 


Lys 


Tyr 


Tyr 


Thr 


Asn 


Asn Phe 


He Thr 


Asn 


Leu Thr 


Phe 


Arg 


545 










550 






555 








560 


Thr 


Ala 


Ser 


Leu 


Trp 


He 


Pro Gly Thr 


Ala Lys 


Pro Gly His 


Trp 


Thr 










565 








570 






575 




Tyr 


Thr 


Leu Met Cys 


Phe 


His 


His Ala 


Lys Leu 


Leu 


Thr Trp 


Lys 


Leu 








580 








585 






590 







<210> 170 
<211> 791 
<212> PRT 

<213> Homo sapiens 



<400> 170 



Met 


Thr 


Gin 


Arg 


Ser 


He Ala Gly Pro 


lie Cys 


Asn Leu 


Lys 


Phe 


Val 


1 








5 










10 






15 




Thr 


Leu 


Leu 


Val 


Ala 


Leu 


Ser 


Ser 


Glu 


Leu Pro 


Phe Leu Gly 


Ala 


Gly 








20 










25 






30 






Val 


Gin 


Leu 


Gin 


Asp 


Asn Gly Tyr Asn 


Gly Leu 


Leu He 


Ala 


He 


Asn 






35 










40 






45 








Pro 


Gin 


Val 


Pro 


Glu 


Asn 


Gin 


Asn 


Leu 


He Ser 


Asn He 


Lys 


Glu 


Met 




50 










55 








60 






He 


Thr 


Glu 


Ala 


Ser 


Phe 


Tyr 


Leu 


Phe 


Asn Ala 


Thr Lys 


Arg 


Arg 


Val 


65 










70 








75 








80 


Phe 


Phe 


Arg 


Asn 


He 


Lys 


He 


Leu 


He 


Pro Ala 


Thr Trp 




Ala 


Asn 










85 










90 






95 




Asn 


Asn 


Ser 


Lys 


He 


Lys 


Gin 


Glu 


Ser 


Tyr Glu 


Lys Ala 


Asn 


Val 


He 








100 










105 






110 






Val 


Thr 


Asp 


Trp 


Tyr 


Gly 


Ala 


His 


Gly 


Asp Asp 


Pro Tyr 


Thr 


Leu 


Gin 






115 










120 






125 








Tyr 


Arg 


Gly 


Cys 


Gly 


Lys 


Glu 


Gly 


Lys 


Tyr He 


His Phe 


Thr 


Pro 


Asn 




130 










135 








140 








Phe 


Leu 




Asn 


Asp 


Asn 


Leu 


Thr 


Ala 


Gly Tyr 


Gly Ser Arg 


Gly Arg 


145 










150 








155 








160 


Val 


Phe 


Val 


His 


Glu 


Trp 


Ala 


His 


Leu 


Arg Trp 


Gly Val 


Phe 


Asp 


Glu 










165 










170 






175 




Tyr 


Asn 


Asn 


Asp 


Lys 


Pro 


Phe 


Tyr 


lie 


Asn Gly 


Gin Asn 


Gin 


He 


Lys 








180 










185 






190 






Val 


Thr 


Arg 


Cys 


Ser 


Ser 


Asp 


He 


Thr 


Gly He 


Phe Val 


Cys 


Glu 


Lys 






195 










200 






205 








Gly 


Pro 


Cys 


Pro 


Gin 


Glu Asn Cys 


He 


He Ser 


Lys Leu 


Phe 


Lys 


Glu 




210 










215 








220 








Gly 


Cys 


Thr 


Phe 


He 


Tyr 


Asn 


Ser 


Thr 


Gin Asn 


Ala Thr 


Ala 


Ser 


He 


225 










230 








235 








240 


Met 


Phe 


Met 


Gin 


Ser 


Leu 


Ser 


Ser 


Val 


Val Glu 


Phe Cys 


Asn 


Ala 


Ser 










245 










250 






255 




Thr 


His 


Asn 


Gin 


Glu 


Ala 


Pro 


Asn 


Leu 


Gin Asn 


Gin Met 


Cys 


Ser 


Leu 








260 










265 






270 






Arg 


Ser 


Ala 


Trp 


Asp 


Val 


He 


Thr 


Asp 


Ser Ala 


Asp Phe 


His 


His 


Ser 
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275 










280 


Phe 


Pro 


Met 


Asn 


Gly 


Thr 


Glu 


Leu 




290 










295 




Val 


Glu 


Ala 


Gly 


Asp 


Lys 


Val 


Val 


305 










310 






Lys 


Met 


Ala 


Glu 


Ala 


Asp 


Arg 


Leu 










325 








Phe 


Tyr 


Leu 


Met 


Gin 


He 


Val 


Glu 








340 










Ser 


Phe 


Asp 


Ser 


Lys 


Gly 


Glu 


He 






355 










360 


Ser 


Asn 


Asp 


Asp 


Arg 


Lys 


Leu 


Leu 




370 










375 




Ser 


Ala 


Lys 


Thr 


Asp 


He 


Ser 


He 


385 










390 






Glu 


Val 


Val 


Glu 


Lys 


Leu 


Asn 


Gly 










405 








Leu 


Val 


Thr 


Ser 


Gly 


Asp 


Asp 


Lys 








420 










Val 


Leu 


Ser 


Ser 


Gly 


Ser 


Thr 


He 






435 










440 


Ma 


Ala 


Pro 


Asn 


Leu 


Glu 


Glu 


Leu 




450 










455 




Phe 


Phe 


Val 


Pro 


Asp 


He 


Ser 


Asn 


465 










470 






Ser 


Arg 


He 


Ser 


Ser 


Gly 


Thr 


Gly 










485 








Leu 


Glu 


Ser 


Thr 


Gly 


Glu 


Asn 


Val 








500 










Thr 


Val 


Thr 


Val 


Asp 


Asn 


Thr 


Val 






515 










520 


Thr 


Trp 


Gin 


Ala 


Ser 


Gly 


Pro 


Pro 




530 










535 




Gly 


Arg 


Lys 


Tyr 


Tyr 


Thr 


Asn 


Asn 


545 










550 






Thr 


Ala 


Ser 


Leu 


Trp 


He 


Pro 


Gly 










565 








Tyr 


Thr 


Leu 


Asn 


Asn 


Thr 


His 


His 








580 










Val 


Thr 


Ser 


Arg 


Ala 


Ser 


Asn 


Ser 






595 










600 


Ala 


Phe 


Val 


Glu 


Arg 


Asp 


Ser 


Leu 




610 










615 




Tyr 


Ala 


Asn 


Val 


Lys 


Gin 


Gly 


Phe 


625 










630 






Thr 


Ala 


Thr 


Val 


Glu 


Pro 


Glu 


Thr 










645 








Leu 


Asp 


Asp 


Gly 


Ala 


Gly 


Ala 


Asp 








660 










Ser 


Arg 


Tyr 


Phe 


Phe 


Ser 


Phe 


Ala 






675 










680 


Val 


His 


Val 


Asn 


His 


Ser 


Pro 


Ser 




690 










695 




Pro 


Gly 


Ser 


His 


Ala 


Met 


Tyr 


Val 


705 










710 






He 


Gin 


Met 


Asn 


Ala 


Pro 


Arg 


Lys 










725 








Arg 


Lys 


Trp 


Gly 


Phe 


Ser 


Arg 


Val 



285 



Pro 


Pro 


Pro 


Pro Thr 


Phe 


Ser 


Leu 








300 








Cys 


Leu 


Val 


Leu Asp 


Val 


Ser 


Ser 






315 








320 


Leu 


Gin 


Leu 


Gin Gin 


Ala 


Ala 


Glu 




330 








335 




He 


His 


Thr 


Phe Val 


Gly 


He 


Ala 


345 








350 






Arg 


Ala 


Gin 


Leu His 


Gin 


He 


Asn 








365 








Val 


Ser 


Tyr Leu Pro 


Thr 


Thr 


Val 








380 








Cys 


Ser 


Gly Leu Lys 


Lys 


Gly 


Phe 






395 








400 


Lys 


Ala 


Tyr Gly Ser 


Val 


Met 


He 




410 








415 




Leu 




Gly 


Asn Cys 


Leu 


Pro 


Thr 


425 








430 






His 


Ser 


He 


Ala Leu 


Gly 


Ser 


Ser 








445 








Ser 


Arg 


Leu Thr Gly 


Gly 


Leu 


Lys 








460 








Ser 


Asn 


Ser 


Met He 


Asp 


Ala 


Phe 






475 








480 


Asp 


He 


Phe 


Gin Gin 


His 


He 


Gin 




490 








495 




Lys 


Pro 


His 


His Gin 


Leu 


Lys 


Asn 


505 








510 






Gly 


Asn 


Asp 


Thr Met 


Phe 


Leu 


Val 








525 








Glu 


He 


He 


Leu Phe 


Asp 


Pro 


Asp 








540 








Phe 


He 


Thr 


Asn Leu 


Thr 


Phe 


Arg 






555 








560 


Thr 


Ala 


Lys 


Pro Gly 


His 


Trp 


Thr 




570 








575 




Ser 


Leu 


Gin 


Ala Leu 


Lys 


Val 


Thr 


585 








590 






Ala 


Val 


Pro 


Pro Ala 


Thr 


Val 


Glu 








605 








His 


Phe 


Pro 


His Pro 


Val 


Met 


He 








620 








Tyr 


Pro 


He 


Leu Asn 


Ala 


Thr 


Val 






635 








640 


Gly 


Asp 


Pro 


Val Thr 


Leu 


Arg 


Leu 




650 








655 




Val 


He 


Lys Asn Asp 


Gly 


He 


Tyr 


665 








670 






Ala 


Asn 


Gly Arg Tyr 


Ser 


Leu 


Lys 


He 


Ser 


Thr 


Pro Ala 


His 


Ser 


He 








700 








Pro 


Gly 


Tyr 


Thr Ala 


Asn 


Gly 


Asn 






715 








720 


Ser 


Val 


Gly Arg Asn 


Glu 


Glu 


Glu 




730 








735 




Ser 


Ser 


Gly Gly Ser 


Phe 


Ser 


Val 
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740 745 750 

Leu Gly Val Pro Ala Gly Pro His Pro Asp Val Phe Pro Pro Cys Lys 

755 760 765 

lie lie Asp Leu Glu Ala Val Asn Arg Arg Gly lie Asp Pro lie Leu 

770 775 780 

Asp Ser Thr Trp Arg Arg Leu 
785 790 



<210> 171 
<211> 1491 
<212> DNA 
<213> Homo sapiens 

<400> 171 

cctcctgcca gccaagtgaa gacatgctta cttccccttc accttccttc atgatgtggg 60 

aagagtgctg caacccagcc ctagccaacg ccgcatgaga gggagtgtgc cgagggcttc 120 

tgagaaggtt tctctcacat ctagaaagaa gcgcttaaga tgtggcagcc cctcttcttc 180 

aagtggctct tgtcctgttg ccctgggagt tctcaaattg ctgcagcagc ctccacccag 240 

cctgaggatg acatcaatac acagaggaag aagagtcagg aaaagatgag agaagttaca 300 

gactctcctg ggcgaccccg agagcttacc attcctcaga cttcttcaca tggtgctaac 360 

agatttgttc ctaaaagtaa agctctagag gccgtcaaat tggcaataga agccgggttc 420 

caccatattg attctgcaca tgtttacaat aatgaggagc aggttggact ggccatccga 480 

agcaagattg cagatggcag tgtgaagaga gaagacatat tctacacttc aaagctttgg 540 

agcaattccc atcgaccaga gttggtccga ccagccttgg aaaggtcact gaaaaatctt 600 

caattggact atgttgacct ctatcttatt cattttccag tgtctgtaaa gccaggtgag 660 

gaagtgatcc caaaagatga aaatggaaaa atactatttg acacagtgga tctctgtgcc 720 

acatgggagg ccatggagaa gtgtaaagat gcaggattgg ccaagtccat cggggtgtcc 7 80 

aacttcaacc acaggctgct ggagatgatc ctcaacaagc cagggctcaa gtacaagcct 840 

gtctgcaacc aggtggaatg tcatccttac ttcaaccaga gaaaactgct ggatttctgc 900 

aagtcaaaag acattgttct ggttgcctat agtgctctgg gatcccatcg agaagaacca 960 

tgggtggacc cgaactcccc ggtgctcttg gaggacccag tcctttgtgc cttggcaaaa 1020 

aagcacaagc gaaccccagc cctgattgcc ctgcgctacc agctgcagcg tggggttgtg 1080 

gtcctggcca agagctacaa tgagcagcgc atcagacaga acgtgcaggt gtttgaattc 114 0 

cagttgactt cagaggagat gaaagccata gatggcctaa acagaaatgt gcgatatttg 1200 

acccttgata tttttgctgg cccccctaat tatccatttt ctgatgaata ttaacatgga 12 60 

gggcattgca tgaggtctgc cagaaggccc tgcgtgtgga tggtgacaca gaggatggct 1320 

ctatgctggt gactggacac atcgcctctg gttaaatctc tcctgcttgg cgacttcagt 138 0 

aagctacagc taagcccatc ggccggaaaa gaaagacaat aattttgttt ttcattttga 1440 

aaaaattaaa tgctctctcc taaagattct tcacctaaaa aaaaaaaaaa a 1491 

<210> 172 
<211> 364 
<212> PRT 
<213> Homo sapiens 

<400> 172 

Met Trp Gin Pro Leu Phe Phe Lys Trp Leu Leu Ser Cys Cys Pro' Gly 

15 10 15 

Ser Ser Gin lie Ala Ala Ala Ala Ser Thr Gin Pro Glu Asp Asp He 

20 25 30 

Asn Thr Gin Arg Lys Lys Ser Gin Glu Lys Met Arg Glu Val Thr Asp 

35 40 45 

Ser Pro Gly Arg Pro Arg Glu Leu Thr He Pro Gin Thr Ser Ser His 

50 55 60 

Gly Ala Asn Arg Phe Val Pro Lys Ser Lys Ala Leu Glu Ala Val Lys 
65 70 75 80 

Leu Ala He Glu Ala Gly Phe His His He Asp Ser Ala His Val Tyr 
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85 90 95 



Asn 


Asn 


Glu 


Glu 


Gin 


Val Gly Leu Ala He 


Arg 


Ser 


Lys He 


Ala 


Asp 








100 




105 






110 






Gly 


Ser 


Val 


Lys 


Arg 


Glu Asp He Phe Tyr 


Thr 


Ser 


Lys Leu 


Trp 


Ser 






115 






120 






125 






Asn 


Ser 


His 


Arg 


Pro 


Glu Leu Val Arg Pro 


Ala 


Leu 


Glu Arg 


Ser 


Leu 




130 








135 




140 








Lys 


Asn 


Leu 


Gin 




Asp Tyr Val Asp Leu 


Tyr 


Leu 


He His 


Phe 


Pro 


145 










150 


155 








160 


Val 


Ser 


Val 


Lys 


Pro 


Gly Glu Glu Val He 


Pro 


Lys 


Asp Glu 




Gly 










165 


170 








175 




Lys 


He 


Leu 


Phe Asp 


Thr Val Asp Leu Cys 


Ala 


Thr 


Trp Glu 


Ala 


Met 








180 




185 






190 






Glu 


Lys 


Cys 


Lys 


Asp 


Ala Gly Leu Ala Lys 


Ser 


He 


Gly Val 


Ser 


Asn 






195 






200 






205 






Phe 


Asn 


His 


Arg Leu 


Leu Glu Met He Leu 


Asn 


Lys 


Pro Gly Leu 


Lys 




210 








215 




220 








Tyr 


Lys 


Pro 


Val 


Cys 


Asn Gin Val Glu Cys 


His 


Pro 


Tyr Phe 


Asn 


Gin 


225 










230 


235 








240 


Arg 


Lys 


Leu 


Leu 


Asp 


Phe Cys Lys Ser Lys 


Asp 


He 


Val Leu 


Val 


Ala 










245 


250 








255 




Tyr 


Ser 


Ala 


Leu 


Gly 


Ser His Arg Glu Glu 


Pro 


Trp Val Asp 


Pro 


Asn 








260 




265 






270 






Ser 


Pro 


Val 


Leu 


Leu 


Glu Asp Pro Val Leu 


Cys 


Ala 


Leu Ala 


Lys 


Lys 






275 






280 






285 






His 


Lys 


Arg 


Thr 


Pro 


Ala Leu He Ala Leu 


Arg 


Tyr 


Gin Leu 


Gin 


Arg 




290 








295 




300 








Gly 


Val 


Val 


Val 


Leu 


Ala Lys Ser Tyr Asn 


Glu Gin Arg He Arg 


Gin 


305 










310 


315 








320 


Asn 


Val 


Gin 


Val 


Phe 


Glu Phe Gin Leu Thr 


Ser 


Glu 


Glu Met 


Lys 


Ala 










325 


330 








335 




He 


Asp 


Gly 


Leu 


Asn 


Arg Asn Val Arg Tyr 


Leu 


Thr 


Leu Asp 


He 


Phe 








340 




345 






350 






Ala 


Gly 


Pro 


Pro 


Asn 


Tyr Pro Phe Ser Asp 


Glu 


Tyr 












355 






360 













<210> 173 
<211> 1988 
<212> DNA 

<213> Homo sapiens 
<400> 173 

cgggagccgc ctccccgcgg cctcttcgct 
tctctgctgt cgcccgtccc gcgcgctcct 
ccgcgccgcc cgtcaacatg atccgctgcg 
tgcccctgct cctactcagc gccatcgcct 
ggttgcagtc tagcgaccac ggccagacgt 
gcggcggcag cgggtcctac gaggagggct 
gagcagcggc tgccatgctc ttctgtggct 
ccttcttcgc cctctgtgga ccccagatgc 
ttgccttggc tgctgtgttc cagatcatct 
agaccttcac ccttcatgcc aaccctgctg 
ttgggtgggc agccacgatt atcctgatcg 
actacgaaga tgaccttctg ggcaatgcca 
ttgggaatga atgtgggaga aaatcgctgc 
tttctccagg cgactttgaa cccatttttt 
aatgctaaaa taatttggga gaaaatattt 



tttgtggcgg cgcccgcgct cgcaggccac 60 
ccgacccgct ccgctccgct ccgctcggcc 120 
gcctggcctg cgagcgctgc cgctggatcc 18 0 
tcgacatcat cgcgctggcc ggccgcggct 240 
cctcgctgtg gtggaaatgc tcccaagagg 300 
gtcagagcct catggagtac gcgtggggta 360 
tcatcatcct ggtgatctgt ttcatcctct 42 0 
ttgtcttcct gagagtgatt ggaggtctcc 480 
ccctggtaat ttaccccgtg aagtacaccc 540 
tcacttacat ctataactgg gcctacggct 600 
gctgtgcctt cttcttctgc tgcctcccca 660 
agcccaggta cttctacaca tctgcctaac 720 
tgctgagatg gactccagaa gaagaaactg 78 0 
ggcagtgttc atattattaa actagtcaaa 840 
tttaagtagt gttatagttt catgtttatc 900 
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ttttattatg ttttgtgaag ttgtgtcttt 
ccttatatct atccataaca tttatactac 
acactttata aggtaaaaat gaggtttcca 
tttccaaata gaatggactt ggtctgttaa 
aagttgttaa tgaccaaaca ttctaaaaga 
tcgaactatt taaggaaagc aaaatcattt 
cattaatatc ctgaatcatt catttcagct 
aggaaagtac tatttcatgg tccaaacctg 
tgtgaaatat ttagatgaaa ttttctcttt 
aaatgctata ttaataaatc tgtagtgttt 
tggattgaaa gatggactgg gtctaattta 
agtaaagcat taggagggtc attcytgtca 
ataaatgact tgcttttcta aatctcaggt 
ctgatagttt gcarctgtaa gcagaaacct 
taaacagatt ttaaatgtct gatataaaac 
tctctgaata gcatatatat gatgcatcgg 
cttacataat gaaaaccaat tcattttaaa 
aaagctaatt gtagttttca ttatgaagtt 
aaaaaaaa 



tcactaatta cctatactat gccaatattt 960 
atttgtaaga gaatatgcac gtgaaactta 1020 
agatttaata atctgatcaa gttcttgtta 108 0 
gggctaagga gaagaggaag ataaggttaa 1140 
aatgcaaaaa aaaagtttat tttcaagcct 1200 
cctaaatgca tatcatttgt gagaatttct 12 60 
aaggcttcat gttgactcga tatgtcatct 132 0 
ttgccatagt tggtaaggct ttcctttaag 1380 
taaagttctt tatagggtta gggtgtggga 144 0 
tgtgtttata tgttcagaac cagagtagac 1500 
tcatgactga tagatctggt taagttgtgt 1560 
caaaagtgcc actaaaacag cctcaggaga 1620 
ttatctgggc tctatcatat agacaggctt 1680 
acatatagtt aaaatcctgg tctttcttgg 17 4 0 
atgccacagg agaattcggg gatttgagtt 1800 
ataggtcatt atgatttttt accatttcga 1860 
tatcagatta ttattttgta agttgtggaa 1920 
ttcccaataa accaggtatt ctaaaaaaaa 1980 
1988 



<210> 174 
<211> 238 
<212> PRT 

<213> Homo sapiens 



<400> 174 



Gly 


Ala 


Ala 


Ser 


Pro 


Arg 


Pro 


Leu 


Arg 


Phe 


Cys 


Gly 


Gly 


Ala 


Arg 


Ala 


1 








5 










10 










15 




Arg 


Arg 


Pro 


Leu 

20 


Ser 


Ala 


Val 


Ala 


Arg 
25 


Pro 


Ala 


Arg 


Ser 


Ser 
30 


Asp 


Pro 


Leu 


Arg 


Ser 
35 


Ala 


Pro 


Leu 


Gly 


Pro 
40 


Ala 


Pro 


Pro 


Val 


Asn 
45 


Met 


He 


Arg 


Cys 


Gly 
50 


Leu 


Ala 


Cys 


Glu 


Arg 
55 


Cys 


Arg 


Trp 


He 


Leu 
60 


Pro 


Leu 


Leu 


Leu 


Leu 


Ser 


Ala 


He 


Ala 


Phe 


Asp 


He 


He 


Ala 


Leu 


Ala 


Gly 


Arg 


Gly 


Trp 


65 










70 










75 










80 


Leu 


Gin 


Ser 


Ser 


Asp 
85 


His 


Gly 


Gin 


Thr 


Ser 
90 


Ser 


Leu 


Trp 


Trp 


Lys 
95 


Cys 


Ser 


Gin 


Glu 


Gly 
100 


Gly 


Gly 


Ser 


Gly 


Ser 
105 


Tyr 


Glu 


Glu 


Gly 


Cys 
110 


Gin 


Ser 


Leu 


Met 


Glu 


Tyr 


Ala 


Trp 


Gly 


Arg 


Ala 


Ala 


Ala 


Ala 


Met 


Leu 


Phe 


Cys 






115 








120 










125 








Gly 


Phe 
130 


He 


He 


Leu 


Val 


He 
135 


Cys 


Phe 


He 


Leu 


Ser 
140 


Phe 


Phe 


Ala 


Leu 


Cys 


Gly 


Pro 


Gin 


Met 




Val 


Phe 




Arg 


Val 


He 


Gly 


Gly 




Leu 


145 










150 










155 










160 


Ala 


Leu 


Ala 


Ala 


Val 
165 


Phe 


Gin 


He 


He 


Ser 
170 


Leu 


Val 


He 


Tyr 


Pro 
175 


Val 


Lys 


Tyr 


Thr 


Gin 
180 


Thr 


Phe 


Thr 


Leu 


His 
185 


Ala 


Asn 


Pro 


Ala 


Val 
190 


Thr 


Tyr 


He 


Tyr 


Asn 
195 


Trp 


Ala 


Tyr 


Gly 


Phe 

200 


Gly 


Trp 


Ala 


Ala 


Thr 
205 


He 


He 


Leu 


He 


Gly 
210 


Cys 


Ala 


Phe 


Phe 


Phe 
215 


Cys 


Cys 


Leu 


Pro 


Asn 

220 


Tyr 


Glu 


Asp 


Asp 


Leu 


Leu 


Gly 


Asn 


Ala 


Lys 


Pro 


Arg 


Tyr 


Phe 


Tyr 


Thr 


Ser 


Ala 







225 230 235 
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<210> 175 . 
<211> 4181 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 3347, 3502, 3506, 3520, 3538, 3549, 3646, 3940, 3968, 3974, 
4036, 4056, 4062, 4080, 4088, 4115 
<223> n = A,T,C or G 

<400> 175 

ggtggatgcg tttgggttgt agctaggctt tttcttttct ttctctttta aaacacatct 60 
agacaaggaa aaaacaagcc tcggatctga tttttcactc ctcgttcttg tgcttggttc 120 
ttactgtgtt tgtgtatttt aaaggcgaga agacgagggg aacaaaacca gctggatcca 180 
tccatcaccg tgggtggttt taatttttcg ttttttctcg ttattttttt ttaaacaacc 240 
actcttcaca atgaacaaac tgtatatcgg aaacctcagc gagaacgccg ccccctcgga 300 
cctagaaagt atcttcaagg acgccaagat cccggtgtcg ggacccttcc tggtgaagac 360 
tggctacgcg ttcgtggact gcccggacga gagctgggcc ctcaaggcca tcgaggcgct 420 
ttcaggtaaa atagaactgc acgggaaacc catagaagtt gagcactcgg tcccaaaaag 48 0 
gcaaaggatt cggaaacttc agatacgaaa tatcccgcct catttacagt gggaggtgct 540 
ggatagttta ctagtccagt atggagtggt ggagagctgt gagcaagtga acactgactc 600 
ggaaactgca gttgtaaatg taacctattc cagtaaggac caagctagac aagcactaga 660 
caaactgaat ggatttcagt tagagaattt caccttgaaa gtagcctata tccctgatga 72 0 
aatggccgcc cagcaaaacc ccttgcagca gccccgaggt cgccgggggc ttgggcagag 780 
gggctcctca aggcaggggt ctccaggatc cgtatccaag cagaaaccat gtgatttgcc 840 
tctgcgcctg ctggttccca cccaatttgt tggagccatc ataggaaaag aaggtgccac 900 
cattcggaac atcaccaaac agacccagtc taaaatcgat gtccaccgta aagaaaatgc 960 
gggggctgct gagaagtcga ttactatcct ctctactcct gaaggcacct ctgcggcttg 1020 
taagtctatt ctggagatta tgcataagga agctcaagat ataaaattca cagaagagat 1080 
ccccttgaag attttagctc ataataactt tgttggacgt cttattggta aagaaggaag 1140 
aaatcttaaa aaaattgagc aagacacaga cactaaaatc acgatatctc cattgcagga 1200 
attgacgctg tataatccag aacgcactat tacagttaaa ggcaatgttg agacatgtgc 1260 
caaagctgag gaggagatca tgaagaaaat cagggagtct tatgaaaatg atattgcttc 1320 
tatgaatctt caagcacatt taattcctgg attaaatctg aacgccttgg gtctgttccc 1380 
acccacttca gggatgccac ctcccacctc agggccccct tcagccatga ctcctcccta 1440 
cccgcagttt gagcaatcag aaacggagac tgttcatcag tttatcccag ctctatcagt 1500 
cggtgccatc atcggcaagc agggccagca catcaagcag ctttctcgct ttgctggagc 1560 
ttcaattaag attgctccag cggaagcacc agatgctaaa gtgaggatgg tgattatcac 1620 
tggaccacca gaggctcagt tcaaggctca gggaagaatt tatggaaaaa ttaaagaaga 1680 
aaactttgtt agtcctaaag aagaggtgaa acttgaagct catatcagag tgccatcctt 1740 
tgctgctggc agagttattg gaaaaggagg caaaacggtg aatgaacttc agaatttgtc 1800 
aagtgcagaa gttgttgtcc ctcgtgacca gacacctgat gagaatgacc aagtggttgt 18 60 
caaaataact ggtcacttct atgcttgcca ggttgcccag agaaaaattc aggaaattct 1920 
gactcaggta aagcagcacc aacaacagaa ggctctgcaa agtggaccac ctcagtcaag 1980 
acggaagtaa aggctcagga aacagcccac cacagaggca gatgccaaac caaagacaga 2040 
ttgcttaacc aacagatggg cgctgacccc ctatccagaa tcacatgcac aagtttttac 2100 
ctagccagtt gtttctgagg accaggcaac ttttgaactc ctgtctctgt gagaatgtat 2160 
actttatgct ctctgaaatg tatgacaccc agctttaaaa caaacaaaca aacaaacaaa 2220 
aaaagggtgg gggagggagg gaaagagaag agctctgcac ttccctttgt tgtagtctca 22 80 
cagtataaca gatattctaa ttcttcttaa tattccccca taatgccaga aattggctta 2340 
atgatgcttt cactaaattc atcaaataga ttgctcctaa atccaattgt taaaattgga 2400 
tcagaataat tatcacagga acttaaatgt taagccatta gcatagaaaa actgttctca 2460 
gttttatttt tacctaacac taacatgagt aacctaaggg aagtgctgaa tggtgttggc 2520 
aggggtatta aacgtgcatt tttactcaac tacctcaggt attcagtaat acaatgaaaa 2580 
gcaaaattgt tccttttttt tgaaaatttt atatacttta taatgataga agtccaaccg 2640 
ttttttaaaa aataaattta aaatttaaca gcaatcagct aacaggcaaa ttaagatttt 2700 
tacttctggc tggtgacagt aaagctggaa aattaatttc agggtttttt gaggcttttg 27 60 
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acacagttat tagttaaatc aaatgttcaa aaatacggag cagtgcctag tatctggaga 2820 
gcagcactac catttattct ttcatttata gttgggaaag tttttgacgg tactaacaaa 2880 
gtggtcgcag gagattttgg aacggctggt ttaaatggct tcaggagact tcagtttttt 2 940 
gtttagctac atgattgaat gcataataaa tgctttgtgc ttctgactat caatacctaa 3000 
agaaagtgca tcagtgaaga gatgcaagac tttcaactga ctggcaaaaa gcaagcttta 3060 
gcttgtctta taggatgctt agtttgccac tacacttcag accaatggga cagtcataga 3120 
tggtgtgaca gtgtttaaac gcaacaaaag gctacatttc catggggcca gcactgtcat 3180 
gagcctcact aagctatttt gaagattttt aagcactgat aaattaaaaa aaaaaaaaaa 3240 
aaattagact ccaccttaag tagtaaagta taacaggatt tctgtatact gtgcaatcag 3300 
ttctttgaaa aaaaagtcaa aagatagaga atacaagaaa agttttnggg atataatttg 33 60 
aatgactgtg aaaacatatg acctttgata acgaactcat ttgctcactc cttgacagca 3420 
aagcccagta cgtacaattg tgttgggtgt gggtggtctc caaggccacg ctgctctctg 3480 
aattgatttt ttgagttttg gnttgnaaga tgatcacagn catgttacac tgatcttnaa 3540 
ggacatatnt tataaccctt taaaaaaaaa atcccctgcc tcattcttat ttcgagatga 3600 
atttcgatac agactagatg tctttctgaa gatcaattag acattntgaa aatgatttaa 3660 
agtgttttcc ttaatgttct ctgaaaacaa gtttcttttg tagttttaac caaaaaagtg 3720 
ccctttttgt cactggtttc tcctagcatt catgattttt ttttcacaca atgaattaaa 37 80 
attgctaaaa tcatggactg gctttctggt tggatttcag gtaagatgtg tttaaggcca 3840 
gagcttttct cagtatttga tttttttccc caatatttga ttttttaaaa atatacacat 3900 
aggagctgca tttaaaacct gctggtttaa attctgtcan atttcacttc tagcctttta 3960 
gtatggcnaa tcanaattta cttttactta agcatttgta atttggagta tctggtacta 4020 
gctaagaaat aattcnataa ttgagttttg tactcnccaa anatgggtca ttcctcatgn 4080 
ataatgtncc cccaatgcag cttcattttc caganacctt ga'cgcaggat aaattttttc 414 0 
atcatttagg tccccaaaaa aaaaaaaaaa aaaaaaaaaa a 4181 

<210> 176 
<211> 579 
<212> PRT 

<213> Homo sapiens 



<400> 176 



Met 


Asn 


Lys 


Leu 


Tyr 


He 


Gly 


Asn 


Leu 


Ser 


Glu 


Asn 


Ala 


Ala 


Pro 


Ser 


1 








5 










10 










15 




Asp 


Leu 


Glu 


Ser 
20 


He 


Phe 


Lys 


Asp 


Ala 
25 


Lys 


He 


Pro 


Val 


Ser 
30 


Gly 


Pro 


Phe 


Leu 


Val 


Lys 


Thr 


Gly 


Tyr 


Ala 


Phe 


Val 


Asp 


Cys 


Pro Asp 


Glu 


Ser 






35 










40 










45 








Trp 


Ala 
50 . 




Lys 


Ala 


He 


Glu 
55 


Ala 


Leu 


Ser 


Gly 


Lys 
60 


He 


Glu 


Leu 


His 


Gly 


Lys 


Pro 


He 


Glu 


Val 


Glu 


His 


Ser 


Val 


Pro 


Lys 


Arg 


Gin 


Arg 


He 


65 










70 










75 










80 


Arg 


Lys 


Leu 


Gin 


He 
85 


Arg 


Asn 


He 


Pro 


Pro 
90 


His 


Leu 


Gin 


Trp 


Glu 
95 


Val 


Leu 


Asp 


Ser 


Leu 
100 


Leu 


Val 


Gin 


Tyr 


Gly 
105 


Val 


Val 


Glu 


Ser 


Cys 
110 


Glu 


Gin 


Val 


Asn 


Thr 
115 


Asp 


Ser 


Glu 


Thr 


Ala 
120 


Val 


Val 


Asn 


Val 


Thr 
125 


Tyr 


Ser 


Ser 


Lys 


Asp 
130 


Gin 


Ala 


Arg 


Gin 


Ala 
135 




Asp 


Lys 


Leu 


Asn 
140 


Gly 


Phe 


Gin 


Leu 


Glu 


Asn 


Phe 


Thr 


Leu 




Val 


Ala 


Tyr 


He 


Pro 




Glu 


Met 


Ala 


Ala 


145 










150 










155 










160 


Gin 


Gin 


Asn 


Pro 


Leu 


Gin 


Gin 


Pro 


Arg 


Gly 


Arg 


Arg 


Gly Leu 


Gly 


Gin 










165 










170 










175 




Arg 


Gly 


Ser 


Ser 


Arg 


Gin 


Gly 


Ser 


Pro 


Gly 


Ser 


Val 


Ser 


Lys 


Gin Lys 








180 










185 










190 






Pro 


Cys 


Asp 
195 


Leu 


Pro 




Arg 


Leu 

200 


Leu 


Val 


Pro 


Thr 


Gin 
205 


Phe 


Val 


Gly 


Ala 


He 


He 


Gly 


Lys 


Glu 


Gly 


Ala 


Thr 


He 


Arg 


Asn 


He 


Thr 


Lys 


Gin 
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210 








215 




Thr 


Gin 


Ser 


Lys lie 


Asp Val 


His 


225 








230 






Glu 


Lys 


Ser 


lie Thr 


He 


Leu 


Ser 








245 








Cys 


Lys 


Ser 


lie Leu 


Glu 


He 


Met 








260 








Phe 


Thr 


Glu 


Glu lie 


Pro 


Leu 


Lys 






275 








280 


Gly 


Arg 


Leu 


lie Gly 


Lys 


Glu 


Gly 




290 








295 




Asp 


Thr 


Asp 


Thr Lys 


He 


Thr 


He 


305 








310 






Tyr 


Asn 


Pro 


Glu Arg 


Thr 


He 


Thr 








325 








Ala 


Lys 


Ala 


Glu Glu 


Glu 


He 


Met 








340 








Asn 


Asp 


lie 


Ala Ser 


Met 


Asn 


Leu 






355 








360 


Asn 


Leu 


Asn 


Ala Leu 


Gly Leu 


Phe 




370 








375 




Pro 


Thr 


Ser Gly Pro 


Pro 


Ser 


Ala 


385 








390 






Glu 


Gin 


Ser 


Glu Thr 


Glu 


Thr 


Val 








405 








Val 


Gly 


Ala 


He He 


Gly 


Lys 


Gin 








420 








Arg 


Phe 


Ala 


Gly Ala 


Ser 


He 


Lys 






435 








440 


Ala 


Lys 


Val 


Arg Met 


Val 


He 


He 




450 








455 




Lys 


Ala 


Gin 


Gly Arg 


He 


Tyr 


Gly 


465 








470 






Ser 


Pro 


Lys 


Glu Glu 


Val 




Leu 








485 








Phe 


Ala 


Ala 


Gly Arg 


Val 


He 


Gly 








500 








Leu 


Gin 


Asn 


Leu Ser 


Ser 


Ala 


Glu 






515 








520 


Pro 


Asp 


Glu Asn Asp 


Gin 


Val 


Val 




530 








535 




Ala 


Cys 


Gin 


Val Ala 


Gin 


Arg 




545 








550 






Lys 


Gin 


His 


Gin Gin 


Gin 


Lys 


Ala 








565 








Arg 


Arg 


Lys 











220 



Arg 


Lys 


Glu Asn Ala Gly Ala 


Ala 






235 






240 


Thr 


Pro 


Glu Gly Thr Ser 


Ala 


Ala 




250 






255 




His 


Lys 


Glu 


Ala Gin Asp 


He 


Lys 


265 






270 






He 


Leu 


Ala 


His Asn Asn 


Phe 


Val 








285 






Arg 


Asn 


Leu 


Lys Lys He 


Glu 


Gin 








300 






Ser 


Pro 


Leu 


Gin Glu Leu 


Thr 


Leu 






315 






320 


Val 


Lys 


Gly Asn Val Glu Thr 


Cys 




330 






335 




Lys 


Lys 


He Arg Glu Ser 


Tyr 


Glu 


345 






350 






Gin 


Ala 


His 


Leu He Pro 


Gly 


Leu 








365 






Pro 


Pro 


Thr 


Ser Gly Met 


Pro 


Pro 








380 






Met 


Thr 


Pro 


Pro Tyr Pro 


Gin 


Phe 






395 






400 


His 


Gin 


Phe 


He Pro Ala 


Leu 


Ser 




410 






415 




Gly 


Gin 


His 


He Lys Gin 


Leu 


Ser 


425 






430 






He 


Ala 


Pro 


Ala Glu Ala 


Pro 


Asp 








445 






Thr 


Gly 


Pro 


Pro Glu Ala 


Gin 


Phe 








460 






Lys 


He 


Lys 


Glu Glu Asn 


Phe 


Val 






475 






480 


Glu 


Ala 


His 


He Arg Val 


Pro 


Ser 




4 90 






495 




Lys 


Gly 


Gly Lys Thr Val 


Asn 


Glu 


505 






510 






Val 


Val 


Val 


Pro Arg Asp 


Gin 


Thr 








525 






Val 


Lys 


He Thr Gly His 


Phe 


Tyr 








540 






He 


Gin 


Glu 


He Leu Thr 


Gin 


Val 






555 






560 


Leu 


Gin 


Ser Gly Pro Pro 


Gin 


Ser 




570 






575 





<210> 177 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<400> 177 

atgccccgta aatgtcttca gtgttcttca gggtagttgg gatctcaaaa gatttggttc 60 
agatccaaac aaatacacat tctgtgtttt agctcagtgt tttctaaaaa aagaaactgc 12 0 
cacacagcaa aaaattgttt actttgttgg acaaaccaaa tcagttctca aaaaatgacc 18 0 
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ggtgcttata aaaagttata aatatcgagt 
gaagtgagct tgtgcttagt atttacattg 
gcaaactggt gcagaaattc tataaactct 
attttgtttt gttttgtaaa aatgataaaa 

<210> 178 
<211> 561 
<212> DNA 

<213> Homo sapiens 



agctctaaaa caaaccacct gaccaagagg 24 0 
gatgccagtt ttgtaatcac tgacttatgt 300 
ttgctgtttt tgatacctgc tttttgtttc 360 
cttcagaaaa t 401 



<400> 178 

acgcctttca agggtgtacg caaagcactc 
gcccgctatg ggacaggggt ctttggccag 
agtgagctgg- ccactgcggt taaagcacga 
gcagccaaag acctaactca gtcccctgag 
ctcccctcca gtcagaagag taaacgtgcc 
gataactata acacattgga gagtactctg 
taagccagtc agttgcaatg tgcaagacag 
ggcccagcag gcccagactg tatccatcca 
ttgtgtctaa agggtaattc cccaaccctt 
gactattttc ccccagtagc g 

<210> 179 
<211> 521 
<212> DNA 

<213> Homo sapiens 



attgataccc ttttggatgg ctatgaaaca 60 
aatgagtacc tacgctatca ggaggccctg 12 0 
attgggagct ctcagcgaca tcaccagtca 18 0 
gtctccccaa caaccatcca ggtgacatac 24 0 
aagcacttcc ttgaattgaa gagctttaag 300 
tgacggagct gaaggactct tgccgtagat 3 60 
gctgcttgcc gggccgccct cggaacatct 42 0 
agttcccgtt gtatccagag ttcttagagc 480 
ccttatgagc atttttagaa cattggctaa 540 
561 



<400> 179 

cccaacgcgt ttgcaaatat tcccctggta 
gatcgagcaa tggcttcagg acatgggttc 
gcatgaagac tggcttgtct cagtgtttca 
ctcgctccct gttagtgccg tatgacagcc 
ttctctgtgg tcaaggttgg ttggctgatt 
acgtgagcag tcagcaccag ttctgcacca 
tttctcctgg ccctgggtgg gctagggcct 
aggataagtg ggatctacca attgattctg 
atgtgggaaa cagatctaaa tctcatttta 



gcctacttcc ttacccccga atattggtaa 60 
tcttctcctg tgatcattca agtgctcact 12 0 
acctcaccag ggctgtctct tggtccacac 180 
cccatcaaat gaccttggcc aagtcacggt 24 0 
ggtggaaagt agggtggacc aaaggaggcc 300 
gcagcgcctc cgtcctagtg ggtgttcctg 360 
gattcgggaa gatgcctttg cagggagggg 420 
gcaaaacaat ttctaagatt tttttgcttt 480 
tgctgtattt t 521 



<210> 180 
<211> 417 
<212> DNA 

<213> Homo sapiens 



<400> 180 

ggtggaattc gccgaagatg gcggaggtgc 
tcctgggccg cctggcggcc atcgtggcta 
tcgtacgctg tgaaggcatc aacatttctg 
tggctttcct ccgcaagcgg atgaacacca 
cccccagccg catcttctgg cggaccgtgc 
gccaggccgc tctggaccgt ctcaaggtgt 
aaaagcggat ggtggttcct gctgccctca 

<210> 181 
<211> 283 
<212> DNA 

<213> Homo sapiens 



aggtcctggt gcttgatggt cgaggccatc 60 
aacaggtact gctgggccgg aaggtggtgg 12 0 
gcaatttcta cagaaacaag ttgaagtacc 18 0 
acccttcccg aggcccctac cacttccggg 24 0 
gaggtatgct gccccacaaa accaagcgag 300 
ttgacggcat cccaccgccc tacgacaaga 360 
aggtcgtgcg tctgaagcct acaagaa 417 



<220> 



WO 02/47534 



94 



PCT7US01/47576 



<221> misc_feature 
<222> 35 

<223> n = A,T,C or G 



<400> 181 

gatttcttct aaataggatg taaaacttct 
caagaactca agtgtaactg tgataaaata 
tgtaatctca gaatacacag gtgacataga 
atttacattg tttacacttc tatgaccagg 
caagtagtgt cttcctacct atctccagat 



ttcanattac tcttcctcag tcctgcctgc 60 
acctttccca ggtatattgg caggtatgtg 120 
tatgatatga caactggtaa tggtggattc 180 
ccttaaggga aggtcagttt tttaaaaaac 240 
acatgtcaaa aaa 283 



<210> 182 






<211> 401 






<212> DNA 






<213> Homo 


sapiens 




<400> 182 






atattcttgc 


tgcttatgca 


gctgacattg 


tatttcccac 


agtgaaagaa 


aacgctggcc 


agaggattga 


gtaagtagtt 


ggatggcttt 


atgctttaag 


aaacatttgt 


tatacattcc 


tagcaggcag 


tgtgttttcc 


ttccatgtct 


gctgcaagtc 


tgtcctatct 


gaattcccag 


ctagcagata 


aaactatggg 


gaaaacttaa 


<210> 183 






<211> 366 






<212> DNA 






<213> Homo 


sapiens 




<220> 






<221> misc 


feature 




<222> 325 






<223> n = A,T,C or G 





ttgccctccc taaagcaacc aagtagcctt 60 
tatcagttac attacaaaag gcagatttca 12 0 
cataaaaaca agaattcaag aagaggattc 180 
tcacaaatta tacctgggat aaaaactatg 240 
ctctgcacta cctgcagtgt gtcctctgag 300 
cagaagcact aagaagctcc accctatcac 360 
atctgtgcat a 4 01 



<400> 183 

accgtgtcca agtttttaga acccttgtta 
accatcatgc tttgatgttc ccctgtcttt 
tttaaggaca aagatgaagt cactgtaaac 
tttttcagtg cagaaattaa aagtaagtat 
gtgtcggaat cactggtaaa tgttggctga 
cactttgagc gctttaagag attancctga 
aaaaaa 



gccagaccga ggtgtcctgg tcaccgtttc 60 
ctctcttctg ctctcaagag caaaggttaa 120 
taatctgtca ttgtttttac cttccttttc 180 
aaagcaccgt gattgggagt gtttttgcgt 24 0 
gaacaatccc tccccttgca cttgtgaaaa 300 
gaaataatta aatatctttt ctcttcaaaa 3 60 
366 



<210> 184 
<211> 370 
<212> DNA 
<213> Homo sapiens 

<400> 184 

tcttacttca aaagaaaaat aaacataaaa 
tttaataatt gtactgagag aaactgctta 
taaaatgtta gtctacatag atgggtgatt 
ttgcattcat gcttctgtgt acacataatg 
tcagtctgct ctgtttaatt ctgctgtctg 
cacagtttag tgatatctag gagtataaag 
ggtttaaaaa 



aataagttgc tggttcctaa caggaaaaat 60 

cgtacacatt gcagatcaaa tatttggagt 12 0 

gtaactttat tgccattaaa agatttcaaa 18 0 

aaaaatgggc aaataatgaa gatctctcct 240 

ctcttctcta atgctgcgtc cctaattgta 300 

ttgtcgccca tcaataaaaa tcacaaagtt 360 
370 
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<210> 185 
<211> 107 
<212> DNA 
<213> Homo sapie: 



<400> 185 

ctcatattat tttccttttg agaaattgga aactctttct gttgctatta tattaataaa 60 
gttggtgttt attttctggt agtcaccttc cccatttaaa aaaaaaa 107 

<210> 186 
<211> 309 
<212> DNA 



<400> 186 

gaaaggatgg ctctggttgc cacagagctg 
agagggccac aggggtggcc gggagttgtc 
gccagtgagt gacagtcatg agggagtgtc 
ttctgtctga atgaaaggcc aaggctacag 
tgcccacgta gtggaggcct ctggcagatc 
tttatggtt 

<210> 187 
<211> 477 
<212> DNA 
<213> Homo sapiens 



ggacttcatg ttcttctaga gagggccaca 60 
agctgatgcc tgctgagagg caggaattgt 12 0 
tcttcttggg gaggaaagaa ggtagagcct 18 0 
tacagggccc cgccccagcc agggtgttaa 24 0 
ctgcattcca aggtcactgg actgtacgtt 300 
309 



<400> 187 

ttcagtccta gcaagaagcg agaattctga 
tccaacctcg ggccagtgtc ttcaggcttt 
tggcctgcaa gccaggccat ccctgggcgc 
cggaggccac aagctcagcc tcaggcccag 
aaggtctagc taggcccaag acctagttac 
aaagttggga gcatggcaga cagggaaggg 
atgtcttcag aagcaagtca ggtttcatgt 
agcccagggc tgtagcacag gcttcacagt 

<210> 188 
<211> 220 
<212> DNA 
<213> Homo sapiens 



gatcctccag aaagtcgagc agcacccacc 60 
actggggacc tgcgagctgg cctaatgtgg 120 
cacagacgag ctccgagcca ggtcaggctt 18 0 
gcactgattg tggcagaggg gccactaccc 24 0 
ccagacagtg agaagcccct ggaaggcaga 300 
aaacattttc agggaaaaga catgtatcac 360 
aaccgagtgt cctcttgcgt gtccaaaagt 420 
gattttgtgt tcagccgtga gtcacac 477 



<400> 188 

taaatatggt agatattaat attcctctta 
ttaaataagt accctgtgag tatgagataa 
cagatgttca agaggaagtt gctattgcat 
ttttttgagc attattttgt atttgttgta 

<210> 189 
<211> 417 
<212> DNA 
<213> Homo sapiens 



gatgaccagt gattccaatt gtcccaagtt 60 
attagtgaca atcagaacaa gtttcagtat 12 0 
tgattttaat atttgtacat aaacactgat 180 
ctttaatacc 22 0 



<220> 

<221> misc_feature 

<222> 76, 77 

<223> n = A,T,C or G 
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<400> 189 

accatcttga cagaggatac atgctcccaa 
ccatcattaa gcatcnnttt caaaattata 
tatcattatt ctagtccttt gaatttgtaa 
atgcactttt ctccagcaca tcagatttca 
gcacttgcta gtactacaca ctttgtacaa 
agaaaagcct tcctttgttg gcccttaaac 
tctgacgata cctgtatgtt cttattgtgt 

<210> 190 
<211> 497 
<212> DNA 
<213> Homo sapiens 



aacgtttgtt accacactta aaaatcactg 60 
gccattcatg atttactttt tccagatgac 12 0 
ggggaaaaaa aacaaaaaca aaaacttacg 180 
aattgaaaat taaagacatg ctatggtaat 24 0 
caaaaaacag aggcaagaaa caacggaaag 300 
tgagtcaaga tctgaaatgt agagatgatc 360 
aaataaaatt gctggtatga aatgaca 417 



<400> 190 

gcactgcggc gctctcccgt cccgcggtgg 
aacgcaggag ctgtcattga ctggcccaca 
acggtccgca aggatgccta catgttctgg 
aacttctcag aactgcccct ggtcatgtgg 
ggatttggaa actttgagga aattgggccc 
acctggctcc aggctgccag tctcctattt 
tatgtgaatg gtagtggtgc ctatgccaag 
gttctcctga agaccttctt cagttgccac 
ttctcagagt cctatgg 

<210> 191 
<211> 175 
<212> DNA 
<213> Homo sapiens 



ttgctgctgc tgccgctgct gctgggcctg 60 
gaggagggca aggaagtatg ggattatgtg 12 0 
tggctctatt atgccaccaa ctcctgcaag 180 
cttcagggcg gtccaggcgg ttctagcact 240 
cttgacagtg atctcaaacc acggaaaacc 300 
gtggataatc ccgtgggcac tgggttcagt 360 
gacctggcta tggtggcttc agacatgatg 420 
aaagaattcc agacagttcc attctacatt 480 
497 



<400> 191 

atgttgaata ttttgcttat taactttgtt tattgtcttc tccctcgatt agaatattag 60 
ctacttgagt acaaggattt gagcctgtta cattcactgc tgaattttag gctcctggaa 120 
gatacccagc attcaataga gaccacacaa taaatatatg tcaaataaaa aaaaa 175 

<210> 192 
<211> 526 
<212> DNA 
<213> Homo sapiens 



<400> 192 

agtaaacatt attatttttt ttatatttgc 
aagaacagta ttgctgtaat tccttttctt 
attgaagaaa gagaaacttg tcaactcata 
ctatcactaa gtaatgtatc cttcagaatg 
tcacaaaatt aaagcaagaa gtccatagta 
tcagagtttc tgaggtcaaa ttttatcttt 
ttacttaatg tattttggtg tattttcctc 
aattcctctg atcactttga gaaacaaact 
ttttaaatat aaaaataaat attgttctga 

<210> 193 
<211> 553 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 



aaaggaaaca tatctaatcc ttcctataga 60 
ttcttcctca tttcctctgc cccttaaaag 120 
tccacgttat ctagcaaagt acataagaat 180 
tgttggttta ccagtgacac cccatattca 240 
atttatttgc taatagtgga tttttaatgc 300 
tcacttacaa gctctatgat cttaaataat 360 
aaattaatat tggtgttcaa gactatatct 420 
tttattaaat gtaaggcact tttctatgaa 4 80 
ttattactga aaaaaa 52 6 
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<222> 290, 300, 411, 441 
<223> n = A,T,C or G 

<400> 193 

tccattgtgg tggaattcgc tctctggtaa aggcgtgcag gtgttggccg cggcctctga 60 
gctgggatga gccgtgctcc cggtggaagc aagggagccc agccggagcc atggccagta 120 
cagtggtagc agttggactg accattgctg ctgcaggatt tgcaggccgt tacgttttgc 180 
aagccatgaa gcatatggag cctcaagtaa aacaagtttt tcaaagccta ccaaaatctg 240 
ccttcagtgg tggctattat agaggtgggt ttgaacccaa aatgacaaan cgggaagcan 300 
cattaatact aggtgtaagc cctactgcca ataaagggaa aataagagat gctcatcgac 360 
gaattatgct tttaaatcat cctgacaaag gaggatctcc ttatatagca nccaaaatca 420 
atgaagctaa agatttacta naaggtcaag ctaaaaaatg aagtaaatgt atgatgaatt 480 
ttaagttcgt attagtttat gtatatgagt actaagtttt tataataaaa tgcctcagag 54 0 
ctacaatttt aaa 553 

<210> 194 
<211> 320 
<212> DNA 
<213> Homo sapiens 

<400> 194 

cccttcccaa tccatcagta aagaccccat ctgccttgtc catgccgttt cccaacaggg 60 
atgtcacttg atatgagaat ctcaaatctc aatgccttat aagcattcct tcctgtgtcc 120 
attaagactc tgataattgt ctcccctcca taggaatttc tcccaggaaa gaaatatatc 180 
cccatctccg tttcatatca gaactaccgt ccccgatatt cccttcagag agattaaaga 240 
ccagaaaaaa gtgagcctct tcatctgcac ctgtaatagt ttcagttcct attttcttcc 300 
attgacccat atttatacct 320 

<210> 195 
<211> 320 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 203, 218 
<223> n = A,T,C or G 

<400> 195 

aagcatgacc tggggaaatg gtcagacctt gtattgtgtt tttggccttg aaagtagcaa 60 
gtgaccagaa tctgccatgg caacaggctt taaaaaagac ccttaaaaag acactgtctc 12 0 
aactgtggtg ttagcaccag ccagctctct gtacatttgc tagcttgtag ttttctaaga 18 0 
ctgagtaaac ttcttatttt tanaaagggg aggctggntt gtaactttcc ttgtacttaa 240 
ttgggtaaaa gtcttttcca caaaccacca tctattttgt gaactttgtt agtcatcttt 300 
tatttggtaa attatgaact 320 

<210> 196 
<211> 357 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 36 

<223> n = A,T,C or G 



<400> 196 

atataaaata atacgaaact ttaaaaagca ttggantgtc agtatgttga atcagtagtt 60 



WO 02/47534 



98 



PCT7US01/47576 



ctagtttctg tgtaagtgta 12 0 
gttatattca tagatttata 18 0 
ctaaccacta tgtacttttt 240 
aaattgttta gctctggcaa 300 
tatgactgtt aaaaaaa 357 

<210> 197 
<211> 565 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 27 

<223> n = A,T,C or G 



tcactttaac 
aatactacaa 
tgatgatatg 
tataaatact 
aaaaaaaaaa 



tgtaaacaat 
aaacttattt 
acatctggct 
gtatggacaa 
ttttaagagc 



ttcttaggac 
atactgttct 
aaaaagaaat 
aaaatggcat 
tggtactaat 



accatttggg 
tatgtcattt 
tattgcaaaa 
tttttatatt 
aaaggattat 



<400> 197 

tcagctgagt accatcagga tatttanccc 
aagcaacaat acttcctctt gacagctttg 
tggtcctaca ctttttagga tgcttggtga 
gttcctatat tttgggctat gtgggtagga 
agaaagtaag cccagggctt cagatctaag 
agttgtaatg ctaggcataa gcactctata 
gaatgtttct gaaacattaa acttgtattt 
aaatgtgtct catacatatg ctgtactagg 
atttgaatat atgaaagaat ttatacaaga 
atataatttg tacctattgt aaaaa 

<210> 198 
<211> 484 
<212> DNA 
<213> Homo sapiens 



tttaagtgct gttttgggag tagaaaacta 60 
attggaatgg ggttattaga tcattcacct 12 0 
acataacacc acttataatg aacatccctg 18 0 
attgttactt gttactgcag cagcagccct 240 
ttagtccaaa agctaaatga tttaaagtca 300 
atacattaaa ttataggccg agcaattagg 360 
atgtcactaa aattctaaca caaacttaaa 420 
cttcatcatg catttctaaa tttgtgtatg 480 
gtgttattta aaattattaa aaataaatgt 54 0 
565 



<400> 198 

tatgtaagta ttggtgtctg ctttaaaaaa 
acatttgaga acagtgttac tctgagcagt 
ctgttggatg tgtccattgt cgccagtttg 
tgggcgcagc agcaggtggc aggggtgtgg 
tctctggtgc tttctgagag ggtctctaaa 
agcacgtatt tctcccctct agtacctctg 
agggcagcag actcttgagt atactgcaga 
tccaggggct caactgacca agtaacacag 
aaac 



ggagacccag acttcacctg tcctttttaa 60 
tgggccacct tcaccttatc cgacagctga 120 
gctgttgccc ggacaggaca ggacctccat 180 
cttgaggtgg gtggcagcgt ctggtcctcc 240 
gcagagtgtg gttggcctgg gggaaggcag 300 
catttgtgag tgttccctct ggctttctga 360 
ggacatgctt tatcagtagg tcctgagggc 42 0 
aagttggggt atgtggccta tttgggtcgg 480 
484 



<210> 199 
<211> 429 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 77, 88, 134, 151, 189, 227, 274, 319 
<223> n = A,T,C or G 



<400> 199 

gcttatgttt tttgttttaa cttttgtttt 
tacagtacct ttctcanaca ttttgtanaa 
gaacattaaa aagngtgata gcgatattag 



ttaacattta gaatattaca ttttgtatta 60 
ttcatttcgg cagctcacta ggattttgct 12 0 
ngccaatcaa atggaaaaaa ggtagtctta 18 0 



WO 02/47534 



99 



PCT7US01/47576 



ataaacaana cacaacgttt ttatacaaca 
attgtttcct attaagtatt attctttggg 
caatttagca tttgctttng gtttttttct 
tatgtactgt atgggaaatg ttgtaaatat 
tgaatccaa 



tactttaaaa tattaanaaa actccttaat 24 0 
caanattttc tgatgctttt gattttctct 300 
ctatttagca ttctgttaag gcacaaaaac 360 
taccttttcc acattttaaa cagacaactt 42 0 
429 



<210> 200 
<211> 279 
<212> DNA 

<213> Homo sapiens 
<400> 200 

gcttttttga ggaattacag ggaagctcct ggaattgtac atggatatct ttatccctag 60 
ggggaaatca aggagctggg cacccctaat tctttatgga agtgtttaaa actattttaa 12 0 
ttttattaca agtattacta gagtagtggt tctactctaa gatttcaaaa gtgcatttaa 180 
aatcatacat gttcccgcct gcaaatatat tgttattttg gtggagaaaa aaatagtata 240 
ttctacataa aaaattaaag atattaacta agaaaaaaa 27 9 

<210> 201 
<211> 569 
<212> DNA 

<213> Homo sapiens 



<400> 201 

taggtcagta tttttagaaa ctcttaatag 
attgttaaag cacacacctg cacaagaagc 
cacaaaaaaa aattctcaaa aagcaaggac 
actggatcat aggaagctta taacaagaat 
gtatccagta acagtagatg ttcaaaatat 
tgtacaacct tgtggttatt actaagcaag 
aattaatgtt atttatacac tgccttccat 
aaatctgaaa tgctactcca atatcagaaa 
gattttaaga gtacagagaa tcatgcacat 
aataaaagtc aaagatgaac tctcaaaaa 

<210> 202 
<211> 501 
<212> DNA 

<213> Homo sapiens 



ctcatactct tgataccaaa agcagccctg 60 
agtgatggtt gcatttacat ttcctgggtg 12 0 
ttacgctttt tgcaaagcct ttgagaagtt 18 0 
ggaagattct taaataactc actttctttg 240 
gtagctgatt aataccagca ttgtgaacgc 300 
ttactactag cttctgaaaa gtagcttcat 360 
gacttttact ttgccctaag ctaatctcca 42 0 
aaaaggggga ggtggaatta tatttcctgt 480 
ctctgattag ttcatatatg tctagtgtgt 540 
569 



<400> 202 

attaataggc ttaataattg ttggcaagga 
tagcatctgg cagtggggcc aagaaaataa 
gagcaacatg attgagaacc agtgtatgtc 
tgtacctgtg tggtctaagc tggaatctgg 
aattcttgac aatgaaatga agctcaatgt 
atagcaccac ctatcagcac tgaaaactct 
gtgactgaca ttatgaaggc ctgtactgaa 
tttcttggca ggctcgttgt acctcttgga 
tggcatattt tggaattctg c 

<210> 203 
<211> 261 
<212> DNA 

<213> Homo sapiens 



tccttttgct ttctttggca tgcaagctcc 60 
ggtttatgca tgtatgatgg ttttcttctt 12 0 
aacaggtgca tttgagataa ctttaaatga 180 
tcaccttcca tccatgcaac aacttgttca 240 
gcatatggat tcaatcccac accatcgatc 30 0 
tttgcattaa gggatcattg caagagcagc 360 
gacagcaagc tgttagtaca gaccagatgc 42 0 
aaacctcaat gcaagatagt gtttcagtgc 480 
501 



<220> 

<221> misc_f eature 
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<222> 36, 96 

<223> n = A,T,C or G 



<400> 203 

gacaagctcc tggtcttgag atgtcttctc 
gataaaatga atgagttctg tcatgattca 
gttagctctt tgaatgttct tgaaatttta 
tatcattgta taaaagctgt tatgtgcaac 
aatacttaaa cactgaaaaa a 



gttaangaga tgggcctttt ggaggtaaag 60 
ctattntata acttgcatga cctttactgt 12 0 
gactttcttt gtaaacaaat gatatgtcct 18 0 
agtgtggaga ttccttgtct gatttaataa 24 0 
261 



<210> 204 
<211> 421 
<212> DNA 
<213> Homo sapiens 



<400> 204 

agcatctttt ctacaacgtt aaaattgcag 
caacaataac aataaatcct aagtgtaaat 
gcctgttttt tccctttttt ctcctgggaa 
gcctctttcc tcttctcatg cttgagcttc 
gctbgtgtgc ttggactcgg ctccaggtgg 
aactcaaacc ttcaagccct aggtgtagcc 
actggcatta acaaaaaaag aagataaaat 
a 



aagtagctta tcattaaaaa acaacaacaa 60 
cagttattct accccctacc aaggatatca 12 0 
taattgtggg cttcttccca aatttctaca 180 
cctgtttgca cgcatgcgtg tgcaggactg 240 
aagcatgctt tcccttgtta ctgttggaga 300 
attttgtcaa gtcatcaact gtatttttgt 360 
attgtaccat taaactttaa taaaacttta 42 0 
421 



<210> 205 
<211> 460 
<212> DNA 
<213> Homo sapiens 

<400> 205 

tactctcaca atgaaggacc tggaatgaaa 
tttagtgcaa atccagagcc agcgtcggtt 
ggaaaagctc tcaggagacc tcacctagat 
tgtcagccaa gagcctttta tttgaaagct 
gaggaagatg ggaaagaaag gacagatttt 
cagactttag aaaactacag gactccaaat 
gaatgagacc aaaggaaaag cttaacatac 
agagaatctt atgtttttta aatggagtta 

<210> 206 
<211> 481 
<212> DNA 
<213> Homo sapiens 



aatctgtgtc taaacaagtc ctctttagat 60 
gcctcgagta attctttcat gggtaccttt 120 
gcctattcaa gctttggaca gccatcagat 180 
cattcttccc cagacttgga ctctgggtca 240 
caggaagaaa atcacatttg tacctttaaa 300 
tttcagtctt atgacttgga cacatagact 360 
tacctcaagg tgaactttta tttaaaagag 42 0 
tgaattttaa 4 60 



<400> 206 

tgtggtggaa ttcgggacgc ccccagaccc 
tgcggaagca gtgacctctg acccctggtg 
gtcccgcggg acttggtttt ctcaagctct 
cgcctgccct gggtggatac ttgaacccca 
cggccttccc atctgcctgc ccacccggag 
acctcccgcc ctcagtcctg cggtgtgcgt 
cctggcctcc gcgcccgccc gcccacgcga 
ggtgtgaccc cctggaggtg ccctcggccc 



tgactttttc ctgcgtgggc cgtctcctcc 60 
accttcgctt tgagtgcctt ttgaacgctg 12 0 
gtctgtccaa agacgctccg gtcgaggtcc 180 
gacgcccctc tgtgctgctg tgtccggagg 240 
ctctttccgc cggcgcaggg tcccaagccc 300 
ctgggcacgt cctgcacaca caatgcaagt 360 
gccgtacccg ccgccaactc tgttatttat 42 0 
accggggcta tttattgttt aatttatttg 480 
481 



<210> 207 
<211> 605 
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<221> misc_feature 
<222> 20, 21, 61 
<223> n = A, T, C or G 



<400> 210 

cgccttgggg agccggcggn ngagtccggg 
nggcccgcgg gcccagggtg gggatgcacc 
agaagaaact tgcagaggcc aagtataagg 
tagcccagat gtcaaagcag ttggacatgt 
aacacaagca ggagatccgg aagaatcctg 
caaccattgg cgtggatccg ctggcctctg 
tgggggactt ctattacgaa ctaggtgtcc 
atcggaatgg aggtctgata actttggagg 
gcaagttcgc ccaggatgtc agtcaagatg 



acgtggagac ccggggtccc ggcagccggg 60 
gccgcggggt gggagctggc gccatcgcca 120 
agcgagggac ggtcttggct gaggaccagc 18 0 
tcaagaccaa cctggaggaa tttgccagca 240 
agttccgtgt gcagttccag gacatgtgtg 300 
gaaaaggatt ttggtctgag atgctgggcg 3 60 
aaattatcga agtgtgcctg gcgctgaagc 42 0 
aactacatca acaggtgttg aagggaaggg 480 
acctgatcag agccatcaag aaa 533 



<210> 211 
<211> 451 
<212> DNA 
<213> Homo sapiens 



<400> 211 

ttagcttgag ccgagaacga ggcgagaaag 
gtgaacgggg aggggaccgt ggggaccggc 
ggagcttcag caaggaagtg gaggagcgga 
tgcgcaaggc agctagcctc acggaggatc 
aagctgccct acccccagtg agccccctga 
agaaatccaa ggctatcatt gaggaatatc 
agtgcgtgca ggagctggcc tcaccctcct 
agtctacgct ggagcgcagt gccattgctc 

<210> 212 
<211> 471 
<212> DNA 
<213> Homo sapiens 



ctggagaccg aggagaccgc ctagagcgga 60 
ttgatcgtgc gcggacacct gctaccaagc 120 
gtagagaacg gccctcccag cctgaggggc 180 
gggaccgtgg gcgggatgcc gtgaagcgag 24 0 
aggcggctct ctctgaggag gagttagaga 300 
tccatctcaa tgacatgaaa gaggcagtcc 360 
tgctcttcat ctttgtacgg catggtgtcg 420 
g 451 



<220> 

<221> misc_feature 
<222> 54 

<223> n = A,T,C or G 
<400> 212 

gtgattattc ttgatcaggg agaagatcat 
gggcaacatt ccacagctgc cctggctgtg 
gcactggggt gggggcggaa ttggggttac 
gagatccagt gcagttgtga tttctgtgga 
ttggcttaaa tccagttttc aatcttcgac 
aacctgtctg acccggtcac gttcttggat 
gggtgggaac tcacgtgggg agcggtggct 
tccatgggac tttccttccc tctcctgctt 

<210> 213 
<211> 511 
<212> DNA 

<213> Homo sapiens 



ttagatttgt tttgcattcc ttanaatgga 60 
atgagtgtcc ttgcaggggc cggagtagga 120 
tcgatgtaag ggattccttg ttgttgtgtt 180 
tcccagcttg gttccaggaa ttttgtgtga 240 
agctgggctg gaacgtgaac tcagtagctg 300 
cctcagaact ctttgctctt gtcggggtgg 3 60 
gagaaaatgt aaggattctg gaatacatat 42 0 
cctcttttcc tgctccctaa c 471 



<220> 

<221> misc_feature 
<222> 27, 63, 337, 442 
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<212> DNA 

<213> Homo sapiens 



<400> 207 

accctttttg gattcagggc tcctcacaat 
tatagaagca tccctttgta tactgttttg 
ctcactggat tctcacggta ggatttctga 
ttttttgatc ctagggtgct ccttttgttt 
ggtggcagaa ttggcaccat tacccaggtc 
tgtatcatga aatgatttga aatcattgta 
tttccttgtg ctttgataac aaagactcca 
aagggctaga ttgggatttg aagacaaaat 
aacattaatg aaagcaaaac attataaaag 
tcttgatgct tccaaatgac atctaccaga 
cataa 



taaaatgagt gtaatgaaac aaggtgaaaa 60 
ctacttacag tgtacttggc attgctttat 120 
gatcttaatc taagctccaa agttgtctac 18 0 
tacagagcag ggtcacttga tttgctagct 24 0 
tgactgacca ccagtcagag gcactttatt 30 0 
aagcagcgaa gtctgataat gaatgccagc 360 
aatattctgg agaacctgga taaaagtttg 42 0 
tgtaggaaat cttacatttt tgcaataaca 48 0 
taattttaat tcaccacata cttatcaatt 54 0 
tatggttttg tggacatctt tttctgttta 600 
605 



<210> 208 
<211> 655 
<212> DNA 
<213> Homo , 



<400> 208 

ggcgttgttc tggattcccg tcgtaactta 
tgatgtcctg caaatgaagg aggaggatgt 
aggtggcacc aatcttgact tccagatgga 
catctatatc ataaatctca agaggacctg 
tgttgccatt gaaaaccctg ctgatgtcag 
ggctgtgctg aagtttgctg ctgccactgg 
tggaaccttc actaaccaga tccaggcagc 
tgaccccagg gctgaccacc agcctctcac 
tgcgctgtgt aacacagatt ctcctctgcg 
caagggagct cactcagtgg gtttgatgtg 
gcgtggcacc atttcccgtg aacacccatg 

<210> 209 
<211> 621 
<212> DNA 
<213> Homo sapiens 



aagggaaact ttcacaatgt ccggagccct 60 
ccttaagttc cttgcagcag gaacccactt 12 0 
acagtacatc tataaaagga aaagtgatgg 180 
ggagaagctt ctgctggcag ctcgtgcaat 240 
tgttatatcc tccaggaata ctggccagag 30 0 
agccactcca attgctggcc gcttcactcc 360 
cttccgggag ccacggcttc ttgtggttac 42 0 
ggaggcatct tatgttaacc tacctaccat 480 
ctatgtggac attgccatcc catgcaacaa 54 0 
gtggatgctg gctcgggaag ttctgcgcat 600 
ggaggtcatg cctgatctgt acttc 655 



<400> 209 

catttagaac atggttatca tccaagacta 
caaatccaca ttcctcttga gttctgcagc 
gccgtagaat cacatgatct gaggaccatt 
gagtcttcca taaagttttg catggagcaa 
tcagccctct aaaagcatag ggcttagcct 
tagttttgta aacactatag catctgttaa 
gccgtgactc tggactatat cagtttttgg 
ccacgtggac cagtctgaat gtctttcctt 
aagaaacaat ctaaacaagt ttctgttgca 
gtaggcttct atattgcatt taacttgttt 
ctattgatga ataaagaaat t 

<210> 210 
<211> 533 
<212> DNA 
<213> Homo sapiens 



ctctaccctg caacattgaa ctcccaagag 60 
ttctgtgtaa atagggcagc tgtcgtctat 12 0 
catggaagct gctaaatagc ctagtctggg 180 
acaaacagga ttaaactagg tttggttcct 240 
gcaggcttcc ttgggctttc tctgtgtgtg 300 
gatccagtgt ccatggaaac cttcccacat 360 
aaagcagggt tcctctgcct gctaacaagc 42 0 
tacacctatg tttttaaata gtcaaacttc 48 0 
tatgtgtttg tgaacttgta tttgtattta 540 
ttgtaactcc tgattcttcc ttttcggata 600 
621 



<220> 
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<223> n = A,T,C or G 



<400> 213 

otaattagaa acttgctgta ctttttnttt 
ctnccatttg cctacaataa attattgcag 
actttatatt tttccttttg ataaagggat 
atctcagccg tttccctgct ttcccttctg 
ctcttttaat cttaaagttc tacatttcat 
taactcttcc cactgcatat ttccatcttg 
ttgagataca gctatttaat atttctggga 
aaggttgttt tgcgtaactg anactccttg 
gccatggccg tgggagtact gggagtaaaa 

<210> 214 
<211> 521 
<212> DNA 
<213> Homo sapiens 



tcttttaggg gtcaaggacc ctctttatag 60 
cagtttgcaa tactaaaata ttttttatag 12 0 
gctgcatagt agagttggtg taattaaact 180 
ctccatatgc ctcattgtcc ttccagggag 24 0 
gctcttagtc aaattctgtt acctttttaa 300 
aattggnggt tctaaattct gaaactgtag 360 
gatgtgcatc cctcttcttt gtggttgccc 42 0 
atatgcttca gagaatttag gcaaacactg 48 0 
t 511 



<400> 214 

agcattgcca aataatccct aattttccac taaaaatata atgaaatgat gttaagcttt 60 
ttgaaaagtt taggttaaac ctactgttgt tagattaatg tatttgttgc ttccctttat 12 0 
ctggaatgtg gcattagctt ttttatttta accctcttta attcttattc aattccatga 180 
cttaaggttg gagagctaaa cactgggatt tttggataac agactgacag ttttgcataa 240 
ttataatcgg cattgtacat agaaaggata tggctacctt ttgttaaatc tgcactttct 300 
aaatatcaaa aaagggaaat gaagtataaa tcaatttttg tataatctgt ttgaaacatg 3 60 
agttttattt gcttaatatt agggctttgc cccttttctg taagtctctt gggatcctgt 42 0 
gtagaagctg ttctcattaa acaccaaaca gttaagtcca ttctctggta ctagctacaa 480 
attcggtttc atattctact taacaattta aataaactga a 521 

<210> 215 
<211> 381 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 17, 20, 60, 61, 365 

<223> n = A,T,C or G 



<400> 215 

gagcggagag cggaccngtn agagccctga gcagccccac cgccgccgcc ggcctagttn 60 
ncatcacacc ccgggaggag ccgcagctgc cgcagccggc cccagtcacc atcaccgcaa 12 0 
ccatgagcag cgaggccgag acccagcagc cgcccgccgc cccccccgcc gcccccgccc 180 
tcagcgccgc cgacaccaag cccggcacta cgggcagcgg cgcagggagc ggtggcccgg 240 
gcggcctcac atcggcggcg cctgccggcg gggacaagaa ggtcatcgca acgaaggttt 300 
tgggaacagt aaaatggttc aatgtaagga acggatatgg tttcatcaac aggaatgaca 360 
ccaangaaga tgtatttgta c 381 



<210> 216 
<211> 425 
<212> DNA 

<213> Homo sapiens 



<400> 216 

ttactaacta ggtcattcaa 
gatggtgttg aaatgtccac 
aacaggccaa tcctgaaggt 
gcataagagt cctatttgcc 



ggaagtcaag ttaacttaaa 
cttcttaaat ttttaagatg 
actccctgtt tgctgcagaa 
ccagttaatt caacttttgt 



catgtcacct aaatgcactt 60 
aacttagttc taaagaagat 12 0 
tgtcagatat tttggatgtt 180 
ctgcctgttt tgtggactgg 24 0 
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ctggctctgt tagaactctg 
aattgacaat atatatgcat 
cataatagta tttattaaag 
tttag 



tccaaaaagt gcatggaata 
gtgtttaaac caaatccaga 
aatcacaact gtaaacatga 



taacttgtaa agcttcccac 300 
aagcttaaac aatagagctg 3 60 
gaataactta aggattctag 42 0 
425 



<210>- 217 
<211> 181 
<212> DNA 
<213> Homo sapiens 



<400> 217 

gagaaaccaa atgataggtt gtagagcctg atgactccaa acaaagccat cacccgcatt 60 
cttcctcctt cttctggtgc tacagctcca agggcccttc accttcatgt ctgaaatgga 12 0 
actttggctt tttcagtgga agaatatgtt gaaggtttca ttttgttcta gaaaaaaaaa 180 
a 181 



<210> 218 






<211> 405 






<212> DNA 






<213> Homo 


sapiens 




<400> 218 






caggccttcc 


agttcactga 


caaacatggg 


agtgatacca 


tcaagcctga 


tgtccaaaag 


gcgctgggct 


gttttagtgc 


caggctgcgg 


tatttttttt 


ttccattagt 


aaaacacaag 


acaaggcagg 


cctttcctac 


agggggtgga 


ggcctgagtt 


ggcgttgtgg 


gcaggctact 


attaatcttt 


tgtagtttgt 


attaaacttg 


<210> 219 






<211> 216 






<212> DNA 






<213> Homo 


sapiens 




<220> 






<221> misc 


feature 




<222> 207," 


210 




<223> n = A,T,C or G 





gaagtgtgcc cagctggctg gaaacctggc 60 
agcaaagaat atttctccaa gcagaagtga 120 
tgggcagcca tgagaacaaa acctcttctg 180 
acttcagatt cagccgaatt gtggtgtctt 240 
gagaccagcc tttcttcctt tggtaggaat 300 
ggtttgtatg atgtattagt agagcaaccc 360 
aactgagaaa aaaaa 4 05 



<400> 219 

actccaagag ttagggcagc agagtggagc gatttagaaa gaacatttta aaacaatcag 60 
ttaatttacc atgtaaaatt gctgtaaatg ataatgtgta cagattttct gttcaaatat 12 0 
tcaattgtaa acttcttgtt aagactgtta cgtttctatt gcttttgtat gggatattgc 180 
aaaaataaaa aggaaagaac cctcttnaan aaaaaa 216 

<210> 220 
<211> 380 
<212> DNA 
<213> Homo sapiens 



<400> 220 

cttacaaatt gcccccatgt gtaggggaca cagaaccctt tgagaaaact tagatttttg 60 
tctgtacaaa gtctttgcct ttttccttct tcattttttt ccagtacatt aaatttgtca 12 0 
atttcatctt tgagggaaac tgattagatg ggttgtgttt gtgttctgat ggagaaaaca 180 
gcaccccaag gactcagaag atgattttaa cagttcagaa cagatgtgtg caatattggt 24 0 
gcatgtaata atgttgagtg gcagtcaaaa gtcatgattt ttatcttagt tcttcattac 300 
tgcattgaaa aggaaaacct gtctgagaaa atgcctgaca gtttaattta aaactatggt 360 
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gtaagtcttt gacaaaaaaa 38 0 

<210> 221 
<211> 398 
<212> DNA 
<213> Homo sapiens 

<400> 221 

ggttagtaag ctgtcgactt tgtaaaaaag ttaaaaatga aaaaaaaagg aaaaatgaat 60 
tgtatattta atgaatgaac atgtacaatt tgccactggg aggaggttcc tttttgttgg 12 0 
gtgagtctgc aagtgaattt cactgatgtt gatattcatt gtgtgtagtt ttatttcggt 180 
cccagccccg tttcctttta ttttggagct aatgccagct gcgtgtctag ttttgagtgc 240 
agtaaaatag aatcagcaaa tcactcttat ttttcatcct tttccggtat tttttgggtt 300 
gtttctgtgg gagcagtgta caccaactct tcctgtatat tgcctttttg ctggaaaatg 360 
ttgtatgttg aataaaattt tctataaaaa ttaaaaaa 398 

<210> 222 
<211> 301 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 49, 64 

<223> n = A,T,C or G 

<400> 222 

ttcgataatt gatctcatgg gctttccctg gaggaaaggt tttttttgnt gtttattttt 60 
taanaacttg aaacttgtaa actgagatgt ctgtagcttt tttgcccatc tgtagtgtat 12 0 
gtgaagattt caaaacctga gagcactttt tctttgttta gaattatgag aaaggcacta 18 0 
gatgacttta ggatttgcat ttttcccttt attgcctcat ttcttgtgac gccttgttgg 240 
ggagggaaat ctgtttattt tttcctacaa ataaaaagct aagattctat atcgcaaaaa 300 
a 301 

<210> 223 
<211> 200 
<212> DNA 

<213> Homo sapiens 
<400> 223 

gtaagtgctt aggaagaaac tttgcaaaca tttaatgagg atacactgtt catttttaaa 60 
attccttcac actgtaattt aatgtgtttt atattctttt gtagtaaaac aacataactc 120 
agatttctac aggagacagt ggttttattt ggattgtctt ctgtaatagg tttcaataaa 18 0 
gctggatgaa cttaaaaaaa 200 

<210> 224 
<211> 385 
<212> DNA 
<213> Homo sapiens 

<400> 224 

gaaaggtttg atccggactc aaagaaagca aaggagtgtg agccgccatc tgctggagca 60 

gctgtaactg caagacctgg acaagagatt cgtcagcgaa ctgcagctca aagaaacctt 120 

tctccaacac cagcaagccc taaccagggc cctcctccac aagttccagt atctcctgga 18 0 

ccaccaaagg acagttctgc ccctggtgga cccccagaaa ggactgttac tccagcccta 24 0 

tcatcaaatg tgttaccaag acatcttgga tcccctgcta cttcagtgcc tggaatgggt 300 

aaacagagca cttaatgtta tttacagttt atattgtttt ctctggttac caataaaacg 360 

ggccattttc aggtggtaaa aaaaa 385 
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<210> 225 
<211> 560 
<212> PRT 

<213> Homo sapiens 



<400> 225 



Met 


Glu 


Cys 


Leu 


Tyr 


Tyr 


Phe Leu 


Gly Phe 


Leu 


Leu 


Leu 


Ala 


Ala 


Arg 


1 








5 






10 










15 




Leu 


Pro 


Leu 


Asp 


Ala 


Ala Lys Arg 


Phe His 


Asp 


Val 


Leu 


Gly 


Asn 


Glu 








20 








25 








30 






Arg 


Pro 


Ser 


Ala 


Tyr 


Met Arg Glu His Asn 


Gin 


Leu 


Asn 


Gly 


Trp 


Ser 






35 








40 








45 








Ser 


Asp 


Glu 


Asn 


Asp 


Trp 


Asn Glu 


Lys Leu 


Tyr 


Pro 


Val 


Trp 


Lys 


Arg 




50 










55 






60 










Gly 


Asp 


Met 


Arg 


Trp 


Lys 


Asn Ser 


Trp Lys 


Gly 


Gly 


Arg 


Val 


Gin 


Ala 


65 










70 






75 










80 


Val 


Leu 


Thr 


Ser 


Asp 


Ser 


Pro Ala 


Leu Val 


Gly 


Ser 


Asn 


He 


Thr 


Phe 










85 






90 










95 




Ala 


Val 


Asn 


Leu 


He 


Phe 


Pro Arg 


Cys Gin 


Lys 


Glu 


Asp 


Ala 


Asn 


Gly 








100 








105 








110 






Asn 


He 


Val 


Tyr 


Glu 


Lys 


Asn Cys 


Arg Asn 


Glu 


Ala 


Gly 


Leu 


Ser 


Ala 






115 








120 








125 








Asp 


Pro 


Tyr 


Val 


Tyr 


Asn 


Trp Thr Ala Trp 


Ser 


Glu 


Asp 


Ser 


Asp 


Gly 




130 










135 






14 0 










Glu 


Asn 


Gly 


Thr 


Gly 


Gin 


Ser His 


His Asn 


Val 


Phe 


Pro 


Asp 


Gly 


Lys 


145 










150 






155 










160 


Pro 


Phe 


Pro 


His 


His 


Pro 


Gly Trp Arg Arg 


Trp 


Asn 


Phe 


He 


Tyr 


Val 










165 






170 










175 




Phe 


His 


Thr 


Leu 


Gly 


Gin 


Tyr Phe 


Gin Lys 


Leu 


Gly 


Arg 


Cys 


Ser 


Val 








180 








185 








190 






Arg 


Val 


Ser 


Val 


Asn 


Thr 


Ala Asn 


Val Thr 


Leu 


Gly 


Pro 


Gin 


Leu 


Met 






195 








200 








205 








Glu 


Val 


Thr 


Val 


Tyr 


Arg 


Arg His 


Gly Arg 


Ala 


Tyr 


Val 


Pro 


He 


Ala 




210 










215 






220 










Gin 


Val 


Lys 


Asp 


Val 


Tyr 


Val Val 


Thr Asp 


Gin 


He 


Pro 


Val 


Phe 


Val 


225 










230 






235 










240 


Thr 


Met 


Phe 


Gin 


Lys 


Asn 


Asp Arg 


Asn Ser 


Ser 


Asp 


Glu 


Thr 


Phe 


Leu 










245 






250 










255 




Lys 


Asp 


Leu 


Pro 


He 


Met 


Phe Asp 


Val Leu 


He 


His 


Asp 


Pro 


Ser 


His 








260 








265 








270 






Phe 


Leu 


Asn 


Tyr 


Ser 


Thr 


He Asn 


Tyr Lys 


Trp 


Ser 


Phe 


Gly 


Asp 


Asn 






275 








280 








285 








Thr 


Gly 


Leu 


Phe 


Val 


Ser 


Thr Asn 


His Thr 


Val 


Asn 


His 


Thr 


Tyr 


Val 




290 










295 






300 










Leu 


Asn 


Gly 


Thr 


Phe 


Ser 


Leu Asn 


Leu Thr 


Val 


Lys 


Ala 


Ala 


Ala 


Pro 


305 










310 






315 










320 


Gly 


Pro 


Cys 


Pro 


Pro 


Pro 


Pro Pro 


Pro Pro 


Arg 


Pro 


Ser 


Lys 


Pro 


Thr 










325 






330 










335 




Pro 


Ser 


Leu 


Gly 


Pro 


Ala 


Gly Asp 


Asn Pro 


Leu 


Glu 


Leu 


Ser 


Arg 


He 








340 








345 








350 






Pro 


Asp 


Glu 


Asn 


Cys 


Gin He Asn Arg Tyr 


Gly 


His 


Phe 


Gin 


Ala 


Thr 






355 








360 








365 








He 


Thr 


He 


Val 


Glu 


Gly 


He Leu 


Glu Val 


Asn 


He 


He 


Gin 


Met 


Thr 




370 










375 






380 










Asp 


Val 


Leu 


Met 


Pro 


Val 


Pro Trp 


Pro Glu 


Ser 


Ser 


Leu 


He 


Asp 


Phe 


385 










390 






395 










400 


Val 


Val 


Thr 


Cys 


Gin 


Gly Ser He 


Pro Thr 


Glu 


Val 


Cys 


Thr 


He 


He 
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405 






410 






415 




Ser 


Asp 


Pro Thr 


Cys 


Glu He 


Thr 


Gin Asn Thr Val 


Cys 


Ser 


Pro 


Val 






420 








425 




430 






Asp 


Val 


Asp Glu Met 


Cys Leu 


Leu 


Thr Val Arg Arg 


Thr 


Phe 


Asn 


Gly 






435 






440 




445 








Ser 


Gly 


Thr Tyr 


Cys 


Val Asn 


Leu 


Thr Leu Gly Asp 


Asp 


Thr 


Ser 


Leu 




450 






455 




460 










Ala 


Leu 


Thr Ser 


Thr 


Leu He 


Ser 


Val Pro Asp Arg 


Asp 


Pro 


Ala 


Ser 


465 








470 




475 








480 


Pro 


Leu 


Arg Met 


Ala 


Asn Ser 


Ala 


Leu He Ser Val 


Gly 


Cys 


Leu 


Ala 








485 






490 






495 




He 


Phe 


Val Thr 


Val 


He Ser 


Leu 


Leu Val Tyr Lys 


Lys 


His 


Lys 


Glu 






500 








505 




510 






Tyr 


Asn 


Pro He 


Glu 


Asn Ser 


Pro 


Gly Asn Val Val 


Arg 


Ser 


Lys 


Gly 






515 






520 




525 








Leu 


Ser 


Val Phe 


Leu 


Asn Arg Ala Lys Ala Val Phe 


Phe 


Pro 


Gly 


Asn 




530 






535 




540 










Gin 


Glu 


Lys Asp 


Pro 


Leu Leu Lys 


Asn Gin Glu Phe 


Lys 


Gly 


Val 


Ser 


545 








550 




555 








560 



<210> 226 
<211> 9 
<212> PRT 

<213> Homo sapiens 
<400> 226 

He Leu He Pro Ala Thr Trp Lys Ala 
1 5 



<210> 227 
<211> 9 
<212> PRT 

<213> Homo sapiens 
<400> 227 

Phe Leu Leu Asn Asp Asn Leu Thr Ala 
1 5 



<210> 228 
<211> 9 
<212> PRT 

<213> Homo sapiens 
<400> 228 

Leu Leu Gly Asn Cys Leu Pro Thr Val 
1 5 



<210> 229 

<211> 10 

<212> PRT 

<213> Homo sapiens 

<400> 229 

Lys Leu Leu Gly Asn Cys Leu Pro Thr Val 



WO 02/47534 



108 



PCT7US01/47576 



15 10 



<210> 230 
<211> 10 
<212> PRT 

<213> Homo sapiens 
<400> 230 

Arg Leu Thr Gly Gly Leu Lys Phe Phe Val 
15 10 



<210> 231 
<211> 9 
<212> PRT 

<213> Homo sapiens 
<400> 231 

Ser Leu Gin Ala Leu Lys Val Thr Val 
1 5 



<210> 232 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 232 

Ala Gly Ala Asp Val He Lys Asn Asp Gly He Tyr Ser Arg Tyr Phe 

15 10 15 

Phe Ser Phe Ala 
20 



<210> 233 
<211> 21 
<212> PRT 

<213> Homo sapiens 
<400> 233 

Phe Phe Ser Phe Ala Ala Asn Gly Arg Tyr Ser Leu Lys Val His Val 

15 10 15 

Asn His Ser Pro Ser 
20 



<210> 234 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 234 

Phe Leu Val Thr Trp Gin Ala Ser Gly Pro Pro Glu He He Leu Phe 

15 10 15 

Asp Pro Asp Gly 
20 
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<210> 235 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 235 

Leu Gin Ser Ala Val Ser Asn lie Ala Gin Ala Pro Leu Phe He Pro 

15 10 15 

Pro Asn Ser Asp 
20 



<210> 236 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 236 

He Gin Asp Asp Phe Asn Asn Ala He Leu Val Asn Thr Ser Lys Arg 

15 10 15 

Asn Pro Gin Gin' 
20 



<210> 237 
<211> 21 
<212> PRT 

<213> Homo sapiens 
<400> 237 

Arg Asn Ser Leu Gin Ser Ala Val Ser Asn He Ala Gin Ala Pro Leu 

15 10 15 

Phe He Pro Pro Asn 
20 



<210> 238 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 238 

Thr His Glu Ser His Arg He Tyr 

1 5 
Asn Ser Leu Gin 
20 



Val Ala He Arg Ala Met Asp Arg 
10 15 



<210> 239 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 239 

Arg Asn Pro Gin Gin Ala Gly He Arg Glu He Phe Thr Phe Ser Pro 

15 10 15 

Gin He Ser Thr 
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20 



<210> 240 
<211> 21 
<212> PRT 

<213> Homo sapiens 
<400> 240 

Gly Gin Ala Thr Ser Tyr Glu lie Arg Met Ser Lys Ser Leu Gin Asn 

1 5 10 15 

He Gin Asp Asp Phe 
20 



<210> 241 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 241 

Glu Arg Lys Trp Gly Phe Ser Arg Val Ser Ser Gly Gly Ser Phe Ser 

15 10 15 

Val Leu Gly Val 
20 



<210> 242 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 242 

Gly Ser His Ala Met Tyr Val Pro Gly Tyr Thr Ala Asn Gly Asn He 

15 10 15 

Gin Met Asn Ala 
20 



<210> 243 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 243 

Val Asn His Ser Pro Ser He Ser Thr Pro Ala His Ser He Pro Gly 

15 10 15 

Ser His Ala Met 
20 



<210> 244 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 244 

Ala Val Pro Pro Ala Thr Val Glu Ala Phe Val Glu Arg Asp Ser Leu 
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His Phe Pro His 



<210> 245 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 245 

Lys Pro Gly His Trp Thr Tyr Thr Leu Asn Asn Thr His His Ser Leu 

15 10 15 

Gin Ala Leu Lys 
20 



<210> 246 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 246 

Asn Leu Thr Phe Arg Thr Ala Ser Leu Trp lie Pro Gly Thr Ala Lys 

1 5 10 15 

Pro Gly His Trp 
20 



<210> 247 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 247 

Leu His Phe Pro His Pro Val Met He Tyr Ala Asn Val Lys Gin Gly 

15 10 15 

Phe Tyr Pro He 
20 



<210> 248 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 248 

Pro Glu Thr Gly Asp Pro Val Thr Leu Arg Leu Leu Asp Asp Gly Ala 

1 5 10 15 

Gly Ala Asp Val 
20 



<210> 249 
<211> 20 
<212> PRT 
<213> Homo 



sapiens 
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<400> 249 

Gly Phe Tyr Pro He Leu Asn Ala Thr Val Thr Ala Thr Val Glu Pro 

15 10 15 

Glu Thr Gly Asp 
20 



<210> 250 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 250 

Phe Asp Pro Asp Gly Arg Lys Tyr Tyr Thr Asn Asn Phe He Thr Asn 

15 10 15 

Leu Thr Phe Arg 
20 



<210> 251 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 251 

Leu Gin Ala Leu Lys Val Thr Val 

1 5 
Val Pro Pro Ala 

20 



Thr Ser Arg Ala Ser Asn Ser Ala 
10 15 



<210> 252 
<211> 153 
<212> PRT 

<213> Homo sapiens 
<400> 252 



Met 


Ala 


Ser 


Val 


Arg 


Val 


Ala Ala 


Tyr 


Phe Glu Asn Phe Leu Ala Ala 


1 








5 








10 15 


Trp 


Arg 


Pro 


Val 


Lys 


Ala 


Ser Asp Gly Asp Tyr Tyr Thr Leu Ala Val 








20 








25 


30 


Pro 


Met 


Gly 


Asp 


Val 


Pro Met Asp 


Gly He Ser Val Ala Asp He Gly 






35 








40 




45 


Ala 


Ala 


Val 


Ser 


Ser 


He 


Phe Asn 


Ser 


Pro Glu Glu Phe Leu Gly Lys 




50 










55 




60 


Ala 


Val 


Gly 


Leu 


Ser 


Ala 


Glu Ala 


Leu 


Thr He Gin Gin Tyr Ala Asp 


65 










70 






75 80 


Val 


leu 


Ser 


Lys 


Ala 


Leu 


Gly Lys 


Glu 


Val Arg Asp Ala Lys He Thr 










85 








90 95 


Pro 


Glu 


Ala 


Phe 


Glu 


Lys 


Leu Gly 


Phe 


Pro Ala Ala Lys Glu He Ala 








100 








105 


110 


Asn 


Met 


Cys 


Arg 


Phe 


Tyr 


Glu Met 


Lys 


Pro Asp Arg Asp Val Asn Leu 






115 








120 




125 


Thr 


His 


Gin 


Leu 


Asn 


Pro 


Lys Val 


Lys 


Ser Phe Ser Gin Phe He Ser 




130 










135 




140 


Glu 


Asn 


Gin 


Gly 


Ala 


Phe 


Lys Gly Met 





145 150 
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<210> 253 
<211> 462 
<212> DNA 
<213> Homo sapiens 



<400> 253 

atggccagtg tccgcgtggc ggcctacttt 
aaagcctctg atggagatta ctacaccttg 
ggtatctctg ttgctgatat tggagcagcc 
tttttaggca aggccgtggg gctcagtgca 
gttttgtcca aggctttggg gaaagaagtc 
gagaagctgg gattccctgc agcaaaggaa 
aagccagacc gagatgtcaa tctcacccac 
cagtttatct cagagaacca gggagccttc 

<210> 254 
<211> 8031 
<212> DNA 
<213> Homo sapiens 



gaaaactttc tcgcggcgtg gcggcccgtg 60 
gctgtaccga tgggagatgt accaatggat 12 0 
gtctctagca tttttaattc tccagaggaa 180 
gaagcactaa caatacagca atatgctgat 240 
cgagatgcaa agattacccc ggaagctttc 300 
atagccaata tgtgtcgttt ctatgaaatg 3 60 
caactaaatc ccaaagtcaa aagcttcagc 420 
aagggcatgt ag 4 62 



<400> 254 

tggcgaatgg gacgcgccct gtagcggcgc 
cagcgtgacc gctacacttg ccagcgccct 
ctttctcgcc acgttcgccg gctttccccg 
gttccgattt agtgctttac ggcacctcga 
acgtagtggg ccatcgccct gatagacggt 
ctttaatagt ggactcttgt tccaaactgg 
ttttgattta taagggattt tgccgatttc 
acaaaaattt aacgcgaatt ttaacaaaat 
tcggggaaat gtgcgcggaa cccctatttg 
tccgctcatg aattaattct tagaaaaact 
tcatatcagg attatcaata ccatattttt 
actcaccgag gcagttccat aggatggcaa 
gtccaacatc aatacaacct attaatttcc 
aatcaccatg agtgacgact gaatccggtg 
agacttgttc aacaggccag ccattacgct 
cgttattcat tcgtgattgc gcctgagcga 
aattacaaac aggaatcgaa tgcaaccggc 
tttcacctga atcaggatat tcttctaata 
tggtgagtaa ccatgcatca tcaggagtac 
taaattccgt cagccagttt agtctgacca 
ctttgccatg tttcagaaac aactctggcg 
tcgcacctga ttgcccgaca ttatcgcgag 
tgttggaatt taatcgcggc ctagagcaag 
cccttgtatt actgtttatg taagcagaca 
cgtgagtttt cgttccactg agcgtcagac 
gatccttttt ttctgcgcgt aatctgctgc 
gtggtttgtt tgccggatca agagctacca 
agagcgcaga taccaaatac tgtccttcta 
aactctgtag caccgcctac atacctcgct 
agtggcgata agtcgtgtct taccgggttg 
cagcggtcgg gctgaacggg gggttcgtgc 
accgaactga gatacctaca gcgtgagcta 
aaggcggaca ggtatccggt aagcggcagg 
ccagggggaa acgcctggta tctttatagt 
cgtcgatttt tgtgatgctc gtcagggggg 
gcctttttac ggttcctggc cttttgctgg 



attaagcgcg gcgggtgtgg tggttacgcg 60 
agcgcccgct cctttcgctt tcttcccttc 12 0 
tcaagctcta aatcgggggc tccctttagg 180 
ccccaaaaaa cttgattagg gtgatggttc 240 
ttttcgccct ttgacgttgg agtccacgtt 300 
aacaacactc aaccctatct cggtctattc 360 
ggcctattgg ttaaaaaatg agctgattta 420 
attaacgttt acaatttcag gtggcacttt 48 0 
tttatttttc taaatacatt caaatatgta 540 
catcgagcat caaatgaaac tgcaatttat 600 
gaaaaagccg tttctgtaat gaaggagaaa 660 
gatcctggta tcggtctgcg attccgactc 720 
cctcgtcaaa aataaggtta tcaagtgaga 78 0 
agaatggcaa aagtttatgc atttctttcc 840 
cgtcatcaaa atcactcgca tcaaccaaac 900 
gacgaaatac gcgatcgctg ttaaaaggac 960 
gcaggaacac tgccagcgca tcaacaatat 1020 
cctggaatgc tgttttcccg gggatcgcag 1080 
ggataaaatg cttgatggtc ggaagaggca 1140 
tctcatctgt aacatcattg gcaacgctac 1200 
catcgggctt cccatacaat cgatagattg 12 60 
cccatttata cccatataaa tcagcatcca 1320 
acgtttcccg ttgaatatgg ctcataacac 13 8 0 
gttttattgt tcatgaccaa aatcccttaa 1440 
cccgtagaaa agatcaaagg atcttcttga 1500 
ttgcaaacaa aaaaaccacc gctaccagcg 15 60 
actctttttc cgaaggtaac tggcttcagc 162 0 
gtgtagccgt agttaggcca ccacttcaag 1680 
ctgctaatcc tgttaccagt ggctgctgcc 17 40 
gactcaagac gatagttacc ggataaggcg 1800 
acacagccca gcttggagcg aacgacctac 18 60 
tgagaaagcg ccacgcttcc cgaagggaga 1920 
gtcggaacag gagagcgcac gagggagctt 1980 
cctgtcgggt ttcgccacct ctgacttgag 204 0 
cggagcctat ggaaaaacgc cagcaacgcg 2100 
ccttttgctc acatgttctt tcctgcgtta 2160 
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tcccctgatt ctgtggataa ccgtattacc gcctttgagt gagctgatac cgctcgccgc 2220 
agccgaacga ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg cctgatgcgg 22 80 
tattttctcc ttacgcatct gtgcggtatt tcacaccgca tatatggtgc actctcagta 2340 
caatctgctc tgatgccgca tagttaagcc agtatacact ccgctatcgc tacgtgactg 2400 
ggtcatggct gcgccccgac acccgccaac acccgctgac gcgccctgac gggcttgtct 2460 
gctcccggca tccgcttaca gacaagctgt gaccgtctcc gggagctgca tgtgtcagag 2520 
gttttcaccg tcatcaccga aacgcgcgag gcagctgcgg taaagctcat cagcgtggtc 2580 
gtgaagcgat tcacagatgt ctgcctgttc atccgcgtcc agctcgttga gtttctccag 2640 
aagcgttaat gtctggcttc tgataaagcg ggccatgtta agggcggttt tttcctgttt 2700 
ggtcactgat gcctccgtgt aagggggatt tctgttcatg ggggtaatga taccgatgaa 27 60 
acgagagagg atgctcacga tacgggttac tgatgatgaa catgcccggt tactggaacg 2820 
ttgtgagggt aaacaactgg cggtatggat gcggcgggac cagagaaaaa tcactcaggg 2 8 80 
tcaatgccag cgcttcgtta atacagatgt aggtgttcca cagggtagcc agcagcatcc 2940 
tgcgatgcag atccggaaca taatggtgca gggcgctgac ttccgcgttt ccagacttta 3000 
cgaaacacgg aaaccgaaga ccattcatgt tgttgctcag gtcgcagacg ttttgcagca 3060 
gcagtcgctt cacgttcgct cgcgtatcgg tgattcattc tgctaaccag taaggcaacc 3120 
ccgccagcct agccgggtcc tcaacgacag gagcacgatc atgcgcaccc gtggggccgc 3180 
catgccggcg ataatggcct gcttctcgcc gaaacgtttg gtggcgggac cagtgacgaa 3240 
ggcttgagcg agggcgtgca agattccgaa taccgcaagc gacaggccga tcatcgtcgc 3300 
gctccagcga aagcggtcct cgccgaaaat gacccagagc gctgccggca cctgtcctac 3360 
gagttgcatg ataaagaaga cagtcataag tgcggcgacg atagtcatgc cccgcgccca 3420 
ccggaaggag ctgactgggt tgaaggctct caagggcatc ggtcgagatc ccggtgccta 3480 
atgagtgagc taacttacat taattgcgtt gcgctcactg cccgctttcc agtcgggaaa 3540 
cctgtcgtgc cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat 3600 
tgggcgccag ggtggttttt cttttcacca gtgagacggg caacagctga ttgcccttca 3660 
ccgcctggcc ctgagagagt tgcagcaagc ggtccacgct ggtttgcccc agcaggcgaa 3720 
aatcctgttt gatggtggtt aacggcggga tataacatga gctgtcttcg gtatcgtcgt 37 8 0 
atcccactac cgagatatcc gcaccaacgc gcagcccgga ctcggtaatg gcgcgcattg 3840 
cgcccagcgc catctgatcg ttggcaacca gcatcgcagt gggaacgatg ccctcattca 3900 
gcatttgcat ggtttgttga aaaccggaca tggcactcca gtcgccttcc cgttccgcta 3960 
tcggctgaat ttgattgcga gtgagatatt tatgccagcc agccagacgc agacgcgccg 4020 
agacagaact taatgggccc gctaacagcg cgatttgctg gtgacccaat gcgaccagat 4080 
gctccacgcc cagtcgcgta ccgtcttcat gggagaaaat aatactgttg atgggtgtct 4140 
ggtcagagac atcaagaaat aacgccggaa cattagtgca ggcagcttcc acagcaatgg 4200 
catcctggtc atccagcgga tagttaatga tcagcccact gacgcgttgc gcgagaagat 4260 
tgtgcaccgc cgctttacag gcttcgacgc cgcttcgttc taccatcgac acca.ccacgc 4320 
tggcacccag ttgatcggcg cgagatttaa tcgccgcgac aatttgcgac ggcgcgtgca 4380 
gggccagact ggaggtggca acgccaatca gcaacgactg tttgcccgcc agttgttgtg 4440 
ccacgcggtt gggaatgtaa ttcagctccg ccatcgccgc ttccactttt tcccgcgttt 4500 
tcgcagaaac gtggctggcc tggttcacca cgcgggaaac ggtctgataa gagacaccgg 4560 
catactctgc gacatcgtat aacgttactg gtttcacatt caccaccctg aattgactct 4620 
cttccgggcg ctatcatgcc ataccgcgaa aggttttgcg ccattcgatg gtgtccggga 4680 
tctcgacgct ctcccttatg cgactcctgc attaggaagc agcccagtag taggttgagg 47 4 0 
ccgttgagca ccgccgccgc aaggaatggt gcatgcaagg agatggcgcc caacagtccc 4 8 00 
ccggccacgg ggcctgccac catacccacg ccgaaacaag cgctcatgag cccgaagtgg 4 8 60 
cgagcccgat cttccccatc ggtgatgtcg gcgatatagg cgccagcaac cgcacctgtg 4920 
gcgccggtga tgccggccac gatgcgtccg gcgtagagga tcgagatctc gatcccgcga 4980 
aattaatacg actcactata ggggaattgt gagcggataa caattcccct ctagaaataa 5040 
ttttgtttaa ctttaagaag gagatataca tatgcagcat caccaccatc accacggagt 5100 
acagcttcaa gacaatgggt ataatggatt gctcattgca attaatcctc aggtacctga 5160 
gaatcagaac ctcatctcaa acattaagga aatgataact gaagcttcat tttacctatt 5220 
taatgctacc aagagaagag tatttttcag aaatataaag attttaatac ctgccacatg 5280 
gaaagctaat aataacagca aaataaaaca agaatcatat gaaaaggcaa atgtcatagt 5340 
gactgactgg tatggggcac atggagatga tccatacacc ctacaataca gagggtgtgg 54 00 
aaaagaggga aaatacattc atttcacacc taatttccta ctgaatgata acttaacagc 54 60 
tggctacgga tcacgaggcc gagtgtttgt ccatgaatgg gcccacctcc gttggggtgt 5520 
gttcgatgag tataacaatg acaaaccttt ctacataaat gggcaaaatc aaattaaagt 558 0 . 
gacaaggtgt tcatctgaca tcacaggcat ttttgtgtgt gaaaaaggtc cttgccccca 5640 
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agaaaactgt attattagta agctttttaa agaaggatgc acctttatct acaatagcac 57 00 
ccaaaatgca actgcatcaa taatgttcat gcaaagttta tcttctgtgg ttgaattttg 57 60 
taatgcaagt acccacaacc aagaagcacc aaacctacag aaccagatgt gcagcctcag 5820 
aagtgcatgg gatgtaatca cagactctgc tgactttcac cacagctttc ccatgaacgg 5880 
gactgagctt ccacctcctc ccacattctc gcttgtagag gctggtgaca aagtggtctg 5940 
tttagtgctg gatgtgtcca gcaagatggc agaggctgac agactccttc aactacaaca 6000 
agccgcagaa ttttatttga tgcagattgt tgaaattcat accttcgtgg gcattgccag 6060 
tttcgacagc aaaggagaga tcagagccca gctacaccaa attaacagca atgatgatcg 6120 
aaagttgctg gtttcatatc tgcccaccac tgtatcagct aaaacagaca tcagcatttg 6180 
ttcagggctt aagaaaggat ttgaggtggt tgaaaaactg aatggaaaag cttatggctc 624 0 
tgtgatgata ttagtgacca gcggagatga taagcttctt ggcaattgct tacccactgt 63 00 
gctcagcagt ggttcaacaa ttcactccat tgccctgggt tcatctgcag ccccaaatct 63 60 
ggaggaatta tcacgtctta caggaggttt aaagttcttt gttccagata tatcaaactc 6420 
caatagcatg attgatgctt tcagtagaat ttcctctgga actggagaca ttttccagca 6480 
acatattcag cttgaaagta caggtgaaaa tgtcaaacct caccatcaat tgaaaaacac 6540 
agtgactgtg gataatactg tgggcaacga cactatgttt ctagttacgt ggcaggccag 6600 
tggtcctcct gagattatat tatttgatcc tgatggacga aaatactaca caaataattt 6660 
tatcaccaat ctaacttttc ggacagctag tctttggatt ccaggaacag ctaagcctgg 6720 
gcactggact tacaccctga acaataccca tcattctctg caagccctga aagtgacagt 6780 
gacctctcgc gcctccaact cagctgtgcc cccagccact 'gtggaagcct ttgtggaaag 6840 
agacagcctc cattttcctc atcctgtgat gatttatgcc aatgtgaaac agggatttta 6900 
tcccattctt aatgccactg tcactgccac agttgagcca gagactggag atcctgttac 6960 
gctgagactc cttgatgatg gagcaggtgc tgatgttata aaaaatgatg gaatttactc 7020 
gaggtatttt ttctcctttg ctgcaaatgg tagatatagc ttgaaagtgc atgtcaatca 7080 
ctctcccagc ataagcaccc cagcccactc tattccaggg agtcatgcta tgtatgtacc 7140 
aggttacaca gcaaacggta atattcagat gaatgctcca aggaaatcag taggcagaaa 7200 
tgaggaggag cgaaagtggg gctttagccg agtcagctca ggaggctcct tttcagtgct 7260 
gggagttcca gctggccccc accctgatgt gtttccacca tgcaaaatta ttgacctgga 7320 
agctgtaaaa gtagaagagg aattgaccct atcttggaca gcacctggag aagactttga 7380 
tcagggccag gctacaagct atgaaataag aatgagtaaa agtctacaga atatccaaga 7440 
tgactttaac aatgctattt tagtaaatac atcaaagcga aatcctcagc aagctggcat 7500 
cagggagata tttacgttct caccccaaat ttccacgaat ggacctgaac atcagccaaa 7560 
tggagaaaca catgaaagcc acagaattta tgttgcaata cgagcaatgg ataggaactc 7 620 
cttacagtct gctgtatcta acattgccca ggcgcctctg tttattcccc ccaattctga 7680 
tcctgtacct gccagagatt atcttatatt gaaaggagtt ttaacagcaa tgggtttgat 7740 
aggaatcatt tgccttatta tagttgtgac acatcatact ttaagcagga aaaagagagc 7800 
agacaagaaa gagaatggaa caaaattatt ataatgaatt ctgcagatat ccatcacact 78 60 
ggcggccgct cgagcaccac caccaccacc actgagatcc ggctgctaac aaagcccgaa 7920 
aggaagctga gttggctgct gccaccgctg agcaataact agcataaccc cttggggcct 7980 
ctaaacgggt cttgaggggt tttttgctga aaggaggaac tatatccgga t 8031 



<210> 255 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 9, 67, 247, 275, 277, 397 

<223> n = A,T,C or G 

<400> 255 

gtggccagng actagaaggc gaggcgccgc 
agtccanagg acggagaaga cgaggaagag 
ggaattattg attcagactt cctctcaaaa 
gacactgaga ggcccattct gcaagtggac 



gggaccatgg cggcggcggc ggacgagcgg 60 
gaggagcagt tggttctggt ggaattatca 120 
tgtgaaaata aatgcaaggt tttgggcatt 180 
agctgtgtct ttgctgggga gtatgaagac 240 
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actctangga cctgtgttat atttgaagaa aatgntnaac atgctgatac agaaggcaat 300 
aataaaacag tgctaaaata taaatgccat acaatgaaga agctcagcat gacaagaact 360 
ctcctgacag agaagaagga aggagaagaa aacatangtg g 401 

<210> 256 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 7, 37, 51, 79, 96, 98, 103, 104, 107, 116, 167, 181, 183, 
194, 206, 276, 303, 307, 308, 310, 323, 332, 341, 353, 374, 
376 

<223> n = A,T,C or G 



<400> 256 



tggtggncct 


gggatgggga 


accgcggtgg 


cttccgngga 


ggtttcggca 


ntggcatccg 


60 


gggccggggt 


cgcggccgng 


gacggggccg 


gggccnangc 


cgnnganctc 


gcggangcaa 


120 


ggccgaggat 


aaggagtgga 








aggacatgaa 


180 


nancaagccc 


ctgnaggaga 


tctatntctt 


cttccctgcc 


ccattaagga 


atcaagagat 


240 


catttgattt 


cttcctgggg 


gcctctctca 


aggatnaggt 


ttttgaagat 


tatgccagtg 300 


canaaannan 


accccgttgc 


ccngtccatc 


tncacccaac 


ncttccaagg 


gcnatttttg 360 


tttaggcctc 


attncngggg 


ggaaccttaa 


cccaatttgg 


g 




401 


<210> 257 














<211> 401 














<212> DNA 














<213> Homo 


sapiens 












<220>° 














<221> misc 


feature 












<222> 382," 


"387 












<223> n = A,T,C or G 












<400> 257 














atgtatgtaa 


aacacttcat 


aaaatgtaaa 


gggctataac 


aaatatgtta 


taaagtgatt 


60 


ctctcagccc 


tgaggtatac 


agaatcattt 


gcctcagact 


gctgttggat 


tttaaaattt 


120 


ttaaaatatc 


tgctaagtaa 


tttgctatgt 


cttctcccac 


actatcaata 


tgcctgcttc 


180 


taacaggctc 


cccactttct 


tttaatgtgc 


tgttatgagc 


tttggacatg 


agataaccgt 240 


gcctgttcag 


agtgtctaca 


gtaagagctg 


gacaaactct 


ggagggacac 


agtctttgag 


300 


acagctcttt 


tggttgcttt 


ccacttttct 


gaaaggttca 


cagtaacctt 


ctagataata 3 60 


gaaactccca 


gttaaagcct 


angctancaa 


ttttttttag 


t 




401 


<210> 258 














<211> 401 














<212> DNA 














<213> Homo 


sapiens 












<400> 258 














ggagcgctag 


gtcggtgtac 


gaccgagatt 


agggtgcgtg 


ccagctccgg 


gaggccgcgg 


60 


tgaggggccg 


ggcccaagct 


gccgacccga 


gccgatcgtc 


agggtcgcca 


gcgcctcagc 


120 


tctgtggagg 


agcagcagta 


gtcggagggt 


gcaggatatt 


agaaatggct 


actccccagt 


180 


caattttcat 


ctttgcaatc 


tgcattttaa 


tgataacaga 


attaattctg 


gcctcaaaaa 


240 


gctactatga 


tatcttaggt 


gtgccaaaat 


cggcatcaga 


gcgccaaatc 


aagaaggcct 


300 


ttcacaagtt 


ggccatgaag 


taccaccctg 


acaaaaataa 


gacccagatg 


ctgaagcaaa 


360 


attcagagag 


attgcagaag 


catatgaaac 


actctcagat 


g 




401 
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<210> 259 
<211> 401 
<212> DNA 

<213> Homo sapiens 
<400> 259 

attgggtttg gagggaggat 
ctccagaata ttgtgggttt 
acagctcagg ctcacagaag 
gtccgaaatg gcaagctgtg 
attagtgcct ctgtgcgcat 
gttcctattc accaactgga 
ctggtggccc ctttgatcat 

<210> 260 
<211> 363 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 7, 9, 19, 41,' 63, 73, 106, 111, 113, 116, 119, 156, 158, 
162, 187, 247, 288, 289, 290, 292, 298, 299, 300, 340 
<223> n = A,T,C or G 



gatgacagag gaatgccctt 
gatcatcaat gcagtcatgt 
ggcagaaact ttgattttca 
cttcatgttc cgagtgggtg 
ccaggtggtc aagaaaacaa 
cattcctgtt gataacccaa 
ctgccacgtg attgacaagc 



tggccatcac ggttttgatt 60 

taggctgcat tttcatgaaa 120 

gccgccatgc tgtgattgcc 180 

acctgaggaa aagcatgatc 24 0 

ctacacctga aggggaggtg 300 

tcgagagcaa taacattttt 360 

g 401 



<400> 260 

aggaganang gagggggana 
canggagagg aancagaaag 
caggtggggg ctggggtggg 
cgctggnctg ttgaaaccca 
cttattnctg gaatgcaagt 
attgctccct tatctgcttg 
aca 



tgaataggga tggagaggga 
gagaggcaag acagggagac 
gcatggagag cctttnangt 
ctccatggct tcctgccact 
ggctgtggct tggagcctcc 
gaatatctga gtttttccan 



natagtggat gagcagggca 60 
acacancaca nangangana 120 
cncccaggcc accctgctct 180 
gcagttgggc ccagggctgg 24 0 
cctctggnnn anggaaannn 300 
cccggaaata aaacacacac 360 
363 



<210> 261 
<211> 401 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 114, 152 
<223> n = A,T,C or G 



<400> 261 

cggctctccg ccgctctccc ggggtttcgg ggcacttggg tcccacagtc tggtcctgct 60 

tcaccttccc ctgacctgag tagtcgccat ggcacaggtt ctcagaggca ctgngactga 12 0 

cttccctgga tttgatgagc gggctgatgc anaaactctt cggaaggcta tgaaaggctt 18 0 

gggcacagat gaggagagca tcctgactct gttgacatcc cgaagtaatg ctcagcgcca 240 

ggaaatctct gcagctttta agactctgtt tggcagggat cttctggatg acctgaaatc 300 

agaactaact ggaaaatttg aaaaattaat tgtggctctg atgaaaccct ctcggcttta 360 

tgatgcttat gaactgaaac atgccttgaa gggagctgga a 4 01 

<210> 262 
<211> 401 
<212> DNA 
<213> Homo sapiens 
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<220> 

<221> misc_feature 

<222> 7, 26, 258, 305, 358, 373, 374, 378 
<223> n = A,T,C or G 



<400> 262 

agtctanaac atttctaata ttttgngctt 
tttttaaata ctgtaaagtg acatatagtt 
agtttataac atgaagaata ttgtaccatt 
ttcaaaagaa taatgataga ggtgaaaata 
tcaactcaaa aattatgntg catagtttta 
tccancttca atgagaaaat aaaatctaca 
tttttttgct aannagcnaa aaatataaac 

<210> 263 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 232, 290, 304, 326, 383 

<223> n = A,T,C or G 



tcatatatca aaggagatta tgtgaaacta 60 
ataagatata tttctgtaca gtagagaaag 12 0 
atacattttc attctcgatc tcataagaaa 18 0 
tgtttacttt ctctaaatca agcctagttg 240 
ttttgaattt aggttttggg actacttttt 300 
actcaggagt tactacagaa gttctaanta 360 
atatgaaaat g 4 01 



<400> 263 

ctgtccgacc aagagaggcc ggccgagccc gaggcttggg cttttgcttt ctggcggagg 60 
gatctgcggc ggtttaggag gcggcgctga tcctgggagg aagaggcagc tacggcggcg 120 
gcggcggtgg cggctagggc ggcggcgaat aaaggggccg ccgccgggtg atgcggtgac 180 
cactgcggca ggcccaggag ctgagtgggc cccggccctc agcccgtccc gncggacccg 24 0 
ctttcctcaa ctctccatct tctcctgccg accgagatcg ccgaggcggn ctcaggctcc 300 
ctancccctt ccccgtccct tccccncccc cgtccccgcc ccgggggccg ccgccacccg 360 
cctcccacca tggctctgaa ganaatccac aaggaattga a 401 

<210> 264 
<211> 401 
<212> DNA 
<213> Homo sapiens 



<400> 264 

aacaccagcc actccaggac ccctgaaggc ctctaccagg tcaccagtgt tctgcgccta 60 
aagccacccc ctggcagaaa cttcagctgt gtgttctgga atactcacgt gagggaactt 120 
actttggcca gcattgacct tcaaagtcag atggaaccca ggacccatcc aacttggctg 180 
cttcacattt tcatcccctc ctgcatcatt gctttcattt tcatagccac agtgatagcc 240 
ctaagaaaac aactctgtca aaagctgtat tcttcaaaag acacaacaaa aagacctgtc 300 
accacaacaa agagggaagt gaacagtgct gtgaatctga acctgtggtc ttgggagcca 360 
gggtgacctg atatgacatc taaagaagct tctggactct g 401 



<210> 265 
<211> 271 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 59 

<223> n = A,T,C or G 



<400> 265 
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gccacttcct gtggacatgg gcagagcgct gctgccagtt cctggtagcc ttgaccacna 60 
cgctgggggg tctttgtgat ggtcatgggt ctcatttgca cttgggggtg tgggattcaa 120 
gttagaagtt tctagatctg gccgggcgca gtggctcaca cctgtaatcc cagcacttta 18 0 
ggaggctgag gcaggcggat catgaggtca ggagatcgag accgtcctgg ctaacacagt 24 0 
gaaaccccgt ctctactaaa aatacaaaaa a 271 

<210> 266 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 45 

<223> n = A,T,C or G 
<400> 266 

attcataaat ttagctgaaa gatactgatt caatttgtat acagngaata taaatgagac 60 
gacagcaaaa ttttcatgaa atgtaaaata tttttatagt ttgttcatac tatatgaggt 120 
tctattttaa atgactttct ggattttaaa aaatttcttt aaatacaatc atttttgtaa 180 
tatttatttt atgcttatga tctagataat tgcagaatat cattttatct gactctgtct 240 
tcataagaga gctgtggccg aattttgaac atctgttata gggagtgatc aaattagaag 300 
gcaatgtgga aaaacaattc tgggaaagat ttctttatat gaagtccctg ccactagcca 360 
gccatcctaa ttgatgaaag ttatctgttc acaggcctgc a 401 

<210> 267 
<211> 401 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 116, 247, 277, 296, 307, 313, 322, 323, 336, 342, 355, 365, 

377, 378, 397 

<223> n = A,T,C or G 

<400> 267 

gaagaggcat cacctgatcc cggagacctt tggagttaag aggcggcgga agcgagggcc 60 

tgtggagtcg gatcctcttc ggggtgagcc agggtcggcg cgcgcggctg tctcanaact 12 0 

catgcagctg ttcccgcgag gcctgtttga ggacgcgctg ccgcccatcg tgctgaggag 18 0 

ccaggtgtac agccttgtgc ctgacaggac cgtggccgac cggcagctga aggagcttca 24 0 

agagcanggg gagacaaaat cgtccagctg ggcttcnact tggatgccca tggaanttat 300 

tctttcnctt ganggactta cnngggaccc aagaanccct tncaaggggc ccttngtgga 360 

tgggncccga aaccccnnta tttgcccttg ggggggncca a 401 

<210> 268 
<211> 223 
<212> DNA 

<213> Homo sapiens 
<400> 268 

tcgccatgtt ggccaggctg gtcttgaact cctgacttta agtgatccac cogcctcaac 60 
ctcccaaagt gctgggatta caggtgtgag ccaccgcgcc tggcctgata catactttta 120 
gaatcaagta gtcacgcact ttttctgttc atttttctaa aaagtaaata tacaaatgtt 180 
ttgttttttg ttttttttgt ttgtttgttt ctgttttttt ttt 223 



<210> 269 
<211> 401 
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<212>, DNA 

<213> Homo sapiens 
<400> 269 

actatgtaaa ccacattgta ctttttttta ctttggcaac aaatatttat acatacaaga 60 
tgctagttca tttgaatatt tctcccaact tatccaagga tctccagctc taacaaaatg 120 
gtttattttt atttaaatgt caatagttgt tttttaaaat ccaaatcaga ggtgcaggcc 18 0 
accagttaaa tgccgtctat caggttttgt gccttaagag actacagagt caaagctcat 240 
ttttaaagga gtaggacaaa gttgtcacag gtttttgttg ttgtttttat tgcccccaaa 300 
attacatgtt aatttccatt tatatcaggg attctattta cttgaagact gtgaagttgc 360 
cattttgtct cattgttttc tttgacataa ctaggatcca t 401 

<210> 270 
<211> 401 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 240, 382 
<223> n = A,T,C or G 

<400> 270 

tggctgttga ttcacctcag cactgcttgg tatctgcacc ctacctctct ttagaggctg 60 
ccttgtcaac tgaaaaatgc acctgacttc gagcaagact ctttccttag gttctggatc 120 
tgtttgagcc ccatggcact gagctggaat ctgagggtct tgttccaagg atgtgatgat 180 
gtgggagaat gttctttgaa agagcagaaa tccagtctgc atggaaacag cctgtagagn 240 
agaagtttcc agtgataagt gttcactgtt ctaaggaggt acaccacagc tacctgaatt 300 
ttcccaaaat gagtgcttct gtgcgttaca actggccttt gtacttgact gtgatgactt 360 
tgttttttct tttcaattct anatgaacat gggaaaaaat g 401 

<210> 271 
<211> 329 
<212> DNA 

<213> Homo sapiens 
<400> 271 

ccacagcctc caagtcaggt ggggtggagt cccagagctg cacagggttt ggcccaagtt 60 
tctaagggag gcacttcctc ccctcgccca tcagtgccag cccctgctgg ctggtgcctg 12 0 
agcccctcag acagccccct gccccgcagg cctgccttct cagggacttc tgcggggcct 180 
gaggcaagcc atggagtgag acccaggagc cggacacttc tcaggaaatg gcttttccca 24 0 
acccccagcc cccacccggt ggttcttcct gttctgtgac tgtgtatagt gccaccacag 300 
cttatggcat ctcattgagg acaaaaaaa 32 9 

<210> 272 
<211> 401 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 1, 7, 12, 21, 61, 62, 66, 72, 78, 88, 90, 92, 98, 117, 119, 
128, 130, 134, 142, 144, 151, 159, 162, 164, 168, 169, 177, 
184, 185, 188, 194, 202, 204, 209, 213, 218, 223, 231, 260, 
272, 299, 300, 306, 321, 322, 323, 331, 335, 336, 338 
<223> n = A,T,C or G 

<221> misc feature 
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<222> 341, 342, 343, 345, 346, 351, 358, 360, 362, 363, 387, 390, 
392 

<223> n = A,T,C or G 



<400> 272 

nggctgntaa cntcggaggt nacttcctgg actatcctgg agaccccctc cgcttccacg 60 
nncatnatat cnctcatngc tgggcccntn angacacnat cccactccaa cacctgngng 120 
atgctggncn cctnggaacc ancntcagaa ngaccctgnt cntntgtnnt ccgcaanctg 18 0 
aagnnaangc gggntacacc tncntgcant ggnccacnct gcngggaact ntacacacct 24 0 
acgggatgtg gctgcgccan gagccaagag cntttctgga tgattcccca gcctcttgnn 300 
agggantcta caacattgct nnntaccttt ntccnncngc nnntnntgga ntacaggngn 360 
tnntaacact acatcttttt tactgcnccn tncttggtgg g 401 

<210> 273 
<211> 401 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_feature 
<222> 399 

<223> n = A,T,C or G 



<400> 273 

cagcaccatg aagatcaaga tcatcgcacc 
tggctccatc ctggcctcac tgtccacctt 
cgacgagtcg ggcccctcca tcgtccaccg 
gtagcatttg ctgcatgggt taattgagaa 
ctcatgctag cctcacgaaa ctggaataag 
tatctgatat cagcactgga ttgtagaact 
aactgttccc cttggtatta acgtgtcagg 

<210> 274 
<211> 401 
<212> DNA 
<213> Homo sapiens 



cccagagcgc aagtactcgg tgtggatcgg 60 

ccagcagatg tggattagca agcaggagta 12 0 

caaatgcttc taaacggact cagcagatgc 180 

tagaaatttg cccctggcaa atgcacacac 240 

ccttcgaaaa gaaattgtcc ttgaagcttg 300 

tgttgctgat tttgaccttg tattgaagtt 360 

gctgagtgnt c 401 



<400> 274 

ccacccacac ccaccgcgcc ctcgttcgcc 
cgccgcccag gccatcgcca ccctccgcag 
cctaccgcag gatgttcggc ggcccgggca 
acgtgactac gtccacccgc acctacagcc 
gcagcctcta cgcctcgtcc ccgggcggcg 
tgcggagcag cgtgcccggg gtgcggctcc 
acgccatcaa caccgagttc aagaacaccc 

<210> 275 
<211> 401 
<212> DNA 
<213> Homo sapiens 



tcttctccgg gagccagtcc gcgccaccgc 60 

ccatgtccac caggtccgtg tcctcgtcct 12 0 

ccgcgagccg gccgagctcc agccggagct 180 

tgggcagcgc gctgcgcccc agcaccagcc 240 

tgtatgccac gcgctcctct gccgtgcgcc 300 

tgcaggactc ggtggacttc tcgctggccg 360 

gcaccaacga g 401 



<400> 275 

ccacttccac cactttgtgg agcagtgcct 
ctggcctggg cctgggcttc gggagagcag 
gaagggactt acctcccaaa ggttctgcag 
agctcctggg tgtgtcagag gccagcctgg 
agggagaggg agaggggacc cgaggctgag 
gacacggcag tgatgctgcg gtctctcctc 



tcagcgcaac ccggatgcca ggtatccctg 60 
agggtgctca ggagggtaag gccagggtgt 12 0 
gggaatctgg agctacacac aggagggatc 18 0 
ggagctctgg ccactgcttc ccatgagctg 240 
gcataagtgg caggatttcg ggaagctggg 300 
ccctttccct ccaggcccag tgccagcacc 360 
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ctcctgaacc actctttctt caagcagatc aagcgacgtg c 4 01 

<210> 276 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 11 

<223> n = A,T,C or G 



<400> 276 

tctgatattg ntacccttga gccacctaag 
attgttgaag aagcacagag ttcagaagac 
tatactttct gtcagccaga aactgtattt 
agtgatgaaa ccagtaatca gcccagtcct 
accgtttctg cttcagaatc tgaagaccgg 
aaggagttga gtaaacgtca gttcagtagt 
gtgattgcaa tcagcatggg atttggccat 



ttagaagaaa ttggaaatca agaagttgtc 60 
tttaacatgg gctcttcctc tagcagccag 12 0 
tcatctcagc ctagtgatga tgaatcaagt 180 
gcctttagac gacgccgtgc taggaagaag 24 0 
ctagttggtg aacaagaaac tgaaccttct 300 
ggtctcaata agtgtgttat acttgctttg 360 
ttctatggca c 4 01 



<210> 277 






<211> 401 






<212> DNA 






<213> Homo 


sapiens 




<220> 






<221> misc 


feature 




<222> 227," 


"333 




<223> n = A,T,C or G 




<400> 277 






aactttggca 


acatatctca 


gcaaaaacta 


tgtgcagagg 


agtggctgca 


atgaggtcac 


gtcctcatca 


cccatccctc 


gaactcaagt 


tccacacatc 


ctgccccatc 


aagatgttct 


gatgcttctt 


gaaaattgct 


tagttgaaaa 


acagtgggaa 


gagaggctgc 


aggaacagcg 


cgggcgcacc 


agtcgtagta 


atccccccaa 


<210> 278 






<211> 401 






<212> DNA 






<213> Homo 


sapiens 




<220> 






<221> misc 


feature 




<222> 322," 


"354 




<223> n = A,T,C or G 





cagctatgtt attcatgcca aaataaaagc 60 
aacggtggtg gatgtaaaag agatcttcaa 120 
cccgctcatt acaaattctt cttgccagtg 180 
catcatgtgt tacgagnggc gctcaaggat 240 
atggagagat cagcttagta aaagatccat 300 
ganaacagtt caggacaaga agaaaacagc 3 60 
accaaaggga a 4 01 



<400> 278 

aatgagtgtg agaccacaaa tgaatgccgg 
ggcttccgtt gttatccacg aaatccttgt 
cgatgtgttt gcccagtctc aaatgccatg 
aaatacatga gcatccgatc tgataggtct 
acaactattt atgccaacac catcaatact 
gagtctacct acgacaacaa anccctgtaa 
caggaccaag agaacatatc gtggacctgg 



gaggatgaaa tgtgttggaa ttatcatggc 60 
caagatccct acattctaac accagagaac 12 0 
tgccgagaac tgccccagtc aatagtctac 18 0 
gtgccatcag acatcttcca gatacaggcc 24 0 
tttcggatta aatctggaaa tgaaaatgga 300 
gtgcaatgct tgtgctcgtg aagncattat 360 
agatgctgac a 4 01 
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<210> 279 
<211> 401 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 30, 35, 81, 88, 180, 212, 378, 384, 391 
<223> n = A,T,C or G 



<400> 279 

aaattattgc ctctgataca tacctaagtn 
cattacttgg agggttgcag nttctaantg 
taggaaacaa gcatgaacgg cagtctagaa 
gccattatcc tgtggaatct gatatgtctg 
tctttggaaa tgatgagatt atttcctgtg 
aatgtgaaac tgaaactaat aattttgatc 
gctctaaata acaaaagnta gggngacaag 

<210> 280 
<211> 326 
<212> DNA 

<213> Homo sapiens 



aacanaacat taatacctaa gtaaacataa 60 
aaactgtatt tgaaactttt aagtatactt 12 0 
taccagaaac atctacttgg gtagcttggn 18 0 
gnagcatgtc attgatggga catgaagaca 240 
ttaaaaaaaa aaaaaatctt aaattcctac 300 
ctgatgtatg ggacagcgta tctgtaccag 360 
nacatgttcc t 401 



<400> 280 

gaagtggaat tgtataattc aattcgataa 
gttttttttg ttgttttttt tttaagaact 
tttttgccca tctgtagtgt atgtgaagat 
tagaattatg agaaaggcac tagatgactt 
atttcttgtg acgccttgtt ggggagggaa 
ctaagattct atatcgcaaa aaaaaa 

<210> 281 
<211> 374 
<212> DNA 
<213> Homo sapiens 



ttgatctcat gggctttccc tggaggaaag 60 
tgaaacttgt aaactgagat gtctgtagct 120 
ttcaaaacct gagagcactt tttctttgtt 18 0 
taggatttgc atttttccct ttattgcctc 240 
atctgtttat tttttcctac aaataaaaag 300 
326 



<400> 281 

caacgcgttt gcaaatattc ccctggtagc 
tcgagcaatg gcttcaggac atgggttctc 
atgaagactg gcttgtctca gtgtttcaac 
cgctccctgt tagtgccgta tgacagcccc 
ctctgtggtc aaggttggtt ggctgattgg 
gtgagcagtc agcaccagtt ctgcaccagc 
tctcctggcc ctgg 

<210> 282 
<211> 404 
<212> DNA 

<213> Homo sapiens 



ctacttcctt acccccgaat attggtaaga 60 
ttctcctgtg atcattcaag tgctcactgc 12 0 
ctcaccaggg ctgtctcttg gtccacacct 180 
catcaaatga ccttggccaa gtcacggttt 240 
tggaaagtag ggtggaccaa aggaggccac 300 
agcgcctccg tcctagtggg tgttcctgtt 360 
374 



<220> 

<221> misc_feature 

<222> 26, 27, 51, 137, 180, 222 

<223> n = A,T,C or G 



<400> 282 
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agtgtggtgg aattcccgca tcctanncgc cgactcacac aaggcagagt ngccatggag 60 
aaaattccag tgtcagcatt cttgctcctt gtggccctct cctacactct ggccagagat 12 0 
accacagtca aacctgnagc caaaaaggac acaaaggact ctcgacccaa actgccccan 18 0 
accctctcca gaggttgggg tgaccaactc atctggactc anacatatga agaagctcta 24 0 
tataaatcca agacaagcaa caaacccttg atgattattc atcacttgga tgagtgccca 300 
cacagtcaag ctttaaagaa agtgtttgct gaaaataaag aaatccagaa attggcagag 360 
cagtttgtcc tcctcaatct ggtttatgaa acaactgaca aaca 404 

<210> 283 
<211> 184 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 26 

<223> n = A,T,C or G 



<400> 283 

agtgtggtgg aattcacttg cttaanttgt 
agcattgtgc aatacagttt cattaactcc 
tttttcaaca ctcttacacc tgttatggaa 
aaaa 

<210> 284 
<211> 421 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 147, 149 
<223> n = A,T,C or G 



gggcaaaaga gaaaaagaag gattgatcag 60 
ttccctcgct cccccaaaaa tttgaatttt 120 
aatgtcaacc tttgtaagaa aaccaaaata 180 
184 



<400> 284 

ctattaatcc tgccacaata tttttaatta cgtacaaaga tctgacatgt cacccaggga 60 
cccatttcac ccactgctct gtttggccgc cagtcttttg tctctctctt cagcaatggt 120 
gaggcggata ccctttcctc ggggaanana aatccatggt ttgttgccct tgccaataac 180 
aaaaatgttg gaaagtcgag tggcaaagct gttgccattg gcatctttca cgtgaaccac 24 0 
gtcaaaagat ccagggtgcc tctctctgtt ggtgatcaca ccaattcttc ctaggttagc 300 
acctccagtc accatacaca ggttaccagt gtcgaacttg atgaaatcag taatcttgcc 360 
agtctctaaa tcaatctgaa tggtatcatt caccttgatg aggggatcgg ggtagcggat 420 
g 421 

<210> 285 
<211> 361 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 34, 188 

<223> n = A,T,C or G 



<400> 285 

ctgggtggta actctttatt tcattgtccg 
cactgtgcag gcttcagctt ccactccggg 
ctgccaggtg cacagccctg gctcccgagg 



gaanaaagat gggagtggga acagggtgga 60 
caggattcag gctatctggg accgcaggga 12 0 
caggcaggca aggtgacggg actggaagcc 18 0 
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cttttcanag ccttggagga gctggtccgt ccacaagcaa tgagtgccac tctgcagttt 24 0 

gcaggggatg gataaacagg gaaacactgt gcattcctca cagccaacag tgtaggtctt 300 

ggtgaagccc cggcgctgag ctaagctcag gctgttccag ggagccacga aactgcaggt 3 60 

a " " ~ 361 



<210> 286 
<211> 336 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 40, 68, 75, 127, 262 

<223> n = A,T,C or G 



<400> 286 

tttgagtggc agcgccttta tttgtggggg ccttcaaggn agggtcgtgg ggggcagcgg 60 

ggaggaanag ccganaaact gtgtgaccgg ggcctcaggt ggtgggcatt gggggctcct 12 0 

cttgcanatg cccattggca tcaccggtgc agccattggt ggcagcgggt accggtcctt 18 0 

tcttgttcaa catagggtag gtggcagcca cgggtccaac tcgcttgagg ctgggccctg 240 

ggcgctccat tttgtgttcc angagcatgt ggttctgtgg cgggagcccc acgcaggccc 300 

tgaggatgtt ctcgatgcag ctgcgctggc ggaaaa 33 6 

<210> 287 
<211> 301 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 15, 33, 44, 53, 76, 83, 107, 117, 154, 166, 192, 194, 207, 

215, 241, 246 

<223> n = A,T,C or G 



<400> 287 

tgggtaccaa atttntttat ttgaaggaat ggnacaaatc aaanaactta agnggatgtt 60 

ttggtacaac ttatanaaaa ggnaaaggaa accccaacat gcatgcnctg ccttggngac 120 

cagggaagtc accccacggc tatggggaaa ttancccgag gcttancttt cattatcact 18 0 

gtctcccagg gngngcttgt caaaaanata ttccnccaag ccaaattcgg gcgctcccat 240 

nttgcncaag ttggtcacgt ggtcacccaa ttctttgatg gctttcacct gctcattcag 300 

g 301 

<210> 288 
<211> 358 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_feature 
<222> 39, 143, 226 
<223> n = A,T,C or G 



<400> 288 

aagtttttaa actttttatt tgcatattaa 
tttgaacaaa aaaaaaaatg gcactctgat 
gggccagctt ggttttactc tanatttcac 
ttcttccttc accaacatgc aagttctttc 
gggaaaggca ggcgcggcct tcgttgtcag 



aaaaattgng cattccaata attaaaatca 60 
taaactgcat tacagcctgc aggacacctt 12 0 
tgtcgtccca ccccacttct tccaccccac 18 0 
cttccctgcc agccanatag atagacagat 24 0 
tagttctttg atgtgaaagg ggcagcacag 300 
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tcatttaaac ttgatccaac ctctttgcat cttacaaagt taaacagcta aaagaagt 358 



<210> 289 
<211> 462 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 87, 141, 182, 220, 269, 327 

<223> n = A,T,C or G 



<400> 289 

ggcatcagaa atgctgttta tttctctgct gctcccaagc tggctggcct ttgcagagga 60 
gcagacaaca gatgcatagt tgggganaaa gggaggacag gttccaggat agagggtgca 12 0 
ggctgaggga ggaagggtaa naggaaggaa ggccatcctg gatccccaca tttcagtctc 180 
anatgaggac aaagggactc ccaagccccc aaatcatcan aaaacaccaa ggagcaggag 240 
gagcttgagc aggccccagg gagcctcana gccataccag ccactgtcta cttcccatcc 300 
tcctctccca ttccctgtct gcttcanacc acctcccagc taagccccag ctccattccc 360 
ccaatcctgg cccttgccag cttgacagtc acagtgcctg gaattccacc actgaggctt 420 
ctcccagttg gattaggacg tcgccctgtt agcatgctgc cc 462 



<210> 290 
<211> 481 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 44, 57, 122, 158, 304, 325, 352, 405 
<223> n = A,T,C or G 



<400> 290 

tactttccta aactttatta aagaaaaaag caataagcaa tggnggtaaa tctctanaac 60 
atacccaatt ttctgggctt cctcccccga gaatgtgaca ttttgatttc caaacatgcc 120 
anaagtgtat ggttcccaac tgtactaaag taggtganaa gctgaagtcc tcaagtgttc 180 
atcttccaac ttttcccagt ctgtggtctg tctttggatc agcaataatt gcctgaacag 24 0 
ctactatggc ttcgttgatt tttgtctgta gctctctgag ctcctctatg tgcagcaatc 300 
gcanaatttg agcagcttca ttaanaactg catctcctgt gtcaaaacca anaatatgtt 360 
tgtctaaagc aacaggtaag ccctcttttg tttgatttgc cttancaact gcatcctgtg 420 
tcaggcgctc ctgaaccaaa atccgaattg ccttaagcat taccaggtaa tcatcatgac 480 
g 481 

<210> 291 
<211> 381 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 79, 166, 187, 208, 219, 315 

<223> n = A,T,C or G 



<400> 291 

tcatagtaat gtaaaaccat ttgtttaatt 
attagtgact ggttaaggng tgccactgta 
cctggtccta gtccacaagg gtggcaggag 
acaaaanaaa ggaaagctgc cttggcanaa 



ctaaatcaaa tcactttcac aacagtgaaa 60 
catatcatca ttttctgact ggggtcagga 120 
gagggtggag gctaanaaca cagaaaacac 18 0 
ggatgaggng gtgagcttgc cgaaggatgg 240 
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tgggaagggg gctccctgtt ggggccgagc caggagtccc aagtcagctc tcctgcctta 300 
cttagctcct ggcanagggt gagtggggac ctacgaggtt caaaatcaaa tggcatttgg 360 
ccagcctggc tttactaaca g 381 

<210> 292 
<211> 371 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 32, 55, 72, 151, 189, 292 

<223> n = A,T,C or G 

<400> 292 

gaaaaaataa tccgtttaat tgaaaaacct gnaggatact attccactcc cccanatgag 60 
gaggctgagg anaccaaacc cctacatcac ctcgtagcca cttctgatac tcttcacgag 12 0 
gcagcaggca aagacaattc ccaaaacctc nacaaaagca attccaaggg ctgctgcagc 180 
taccaccanc acatttttcc tcagccagcc cccaatcttc tccacacagc cctccttatg 240 
gatcgccttc tcgttgaaat taatcccaca gcccacagta acattaatgc ancaggagtc 300 
ggggactcgg ttcttcgaca tggaagggat tttctcccaa tctgtgtagt tagcagcccc 360 
acagcactta a 371 

<210> 293 
<211> 361 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 75, 196, 222 
<223> n = A,T,C or G 

<400> 293 

gatttaaaag aaaacacttt attgttcagc aattaaaagt tagccaaata tgtatttttc 60 

tccataattt attgngatgt tatcaacatc aagtaaaatg ctcattttca tcatttgctt 120 

ctgttcatgt tttcttgaac acgtcttcaa ttttccttcc aaaatgctgc atgccacact 180 

tgaggtaacg aagcanaagt atttttaaac atgacagcta anaacattca tctacagcaa 240 

cctatatgct caatacatgc cgcgtgatcc tagtagtttt ttcacaacct tctacaagtt 300 

tttggaaaac atctgttatg atgactttca tacaccttca cctcaaaggc tttcttgcac 360 

c 361 

<210> 294 
<211> 391 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 26, 77, 96, 150, 203, 252, 254, 264, 276 
<223> n = A,T,C or G 

<400> 294 

tattttaaag tttaattatg attcanaaaa aatcgagcga ataactttct ctgaaaaaat 60 

atattgactc tgtatanacc acagttattg gggganaagg gctggtaggt taaattatcc 12 0 

tattttttat tctgaaaatg atattaatan aaagtcccgt ttccagtctg attataaaga 18 0 

tacatatgcc caaaatggct ganaataaat acaacaggaa atgcaaaagc tgtaaagcta 24 0 

agggcatgca ananaaaatc tcanaatacc caaagnggca acaaggaacg tttggctgga 300 
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atttgaagtt atttcagtca tctttgtctt tggctccatg tttcaggatg cgtgtgaact 360 
cgatgtaatt gaaattcccc tttttatcaa t 391 

<210> 295 
<211> 343 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
. <222> 145, 174, 205, 232 
<223> n = A,T,C or G 

<400> 295 

ttcttttgtt ttattgataa cagaaactgt gcataattac agatttgatg aggaatctgc 60 

aaataataaa gaatgtgtct actgccagca aaatacaatt attccatgcc ctctcaacat 12 0 

acaaatatag agttcttcac accanatggc tctggtgtaa caaagccatt ttanatgttt 18 0 

aattgtgctt ctacaaaacc ttcanagcat gaggtiagttt cttttaccta cnatattttc 240 

cacatttcca ttattacact tttagtgagc taaaatcctt ttaacatagc ctgcggatga 300 

tctttcacaa aagccaagcc tcatttacaa agggtttatt tct 343 

<210> 296 
<211> 241 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 96, 98, 106, 185 
<223> n = A,T,C or G 

<400> 296 

ttcttggata ttggttgttt ttgtgaaaaa gtttttgttt ttcttctcag tcaactgaat 60 
tatttctcta ctttgccctc ctgatgccca catgananaa cttaanataa tttctaacag 120 
cttccacttt ggaaaaaaaa aaaacctgtt ttcctcatgg aaccccagga gttgaaagtg 180 
gatanatcgc tctcaaaatc taaggctctg ttcagcttta cattatgtta cctgacgttt 240 
t 241 

<210> 297 
<211> 391 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 12, 130 

<223> n = A,T,C or G 

<400> 297 

gttgtggctg anaatgctgg agatgctcag ttctctccct cacaaggtag gccacaaatt 60 
cttggtggtg ccctcacatc tggggtcttc aggcaccagc catgcctgcc gaggagtgct 12 0 
gtcaggacan accatgtccg tgctaggccc aggcacagcc caaccactcc tcatccaagt 18 0 
ctctcccagg tttctggtcc cgatgggcaa ggatgacccc tccagtggct ggtaccccac 240 
catcccacta cccctcacat gctctcactc tccatcaggt ccccaatcct ggcttccctc 300 
ttcacgaact ctcaaagaaa aggaaggata aaacctaaat aaaccagaca gaagcagctc 360 
tggaaaagta caaaaagaca gccagaggtg t 391 



<210> 298 
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<211> 321 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 14, 30, 76, 116, 201, 288, 301 
<223> n = A,T,C or G 



<400> 298 

caagccaaac tgtntccagc tttattaaan atactttcca taaacaatca tggtatttca 60 
ggcaggacat gggcanacaa tcgttaacag tatacaacaa ctttcaaact cccttnttca 12 0 
atggactacc aaaaatcaaa aagccactat aaaacccaat gaagtcttca tctgatgctc 180 
tgaacaggga aagtttaaag ngagggttga catttcacat ttagcatgtt gtttaacaac 240 
ttttcacaag ccgaccctga ctttcaggaa gtgaaatgaa aatggcanaa tttatctgaa 300 
natccacaat ctaaaaatgg a 321 



<210> 299 
<211> 401 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 104, 268, 347 
<223> n = A,T,C or G 



<400> 299 






tatcataaag 


agtgttgaag 


tttatttatt 


attggtaaaa 


aaataaaaca 


aaaagcattt 


agaagtatca 


tttttctttg 


tcaaattata 


tggaattttg 


tcggtcactt 


gcactggttg 


agttaaattt 


tttttgttgg 


gatttcanat 


caacgtccac 


accaaattct 


tgatcaggac 


taggtagtct 


cacagccttg 


cgtgttcgat 


<210> 300 






<211> 188 






<212> DNA 






<213> Homo 


sapiens 




<220> 






<221> misc 


feature 




<222> 48 






<223> n = A,T,C or G 





atagcaccat tgagacattt tgaaattgga 60 

gaattgtatt tggnggaaca gcaaaaaaag 12 0 

ctgtttccaa acattttgga aataaataac 180 

acaagattag aacaagagga acacatatgg 240 

agagtttggt ttataaaaag caaacagggc 300 

caccaatgtc atagggngca atatctacaa 360 

attcaaagac t 4 01 



<400> 300 

tgaatgcttt gtcatattaa 
ggtgtatctt gtttctaata 
tgtatgtcag tgtataaaac 
gaaaaaaa 



gaaagttaaa gtgcaataat 
agataaactt ttttgtcttt 
atactgtgtg gtataacagg 



gtttgaanac aataagtggt 60 
gctttatctt attagggagt 12 0 
cttaataaat tctttaaaag 180 
188 



<210> 301 
<211> 291 
<212> DNA 

<213> Homo sapiens 



<400> 301 
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aagattttgt tttattttat tatggctaga aagacactgt tatagccaaa atcggcaatg 60 
acactaaaga aatcctctgt gcttttcaat atgcaaatat atttcttcca agagttgccc 120 
tggtgtgact tcaagagttc atgttaactt cttttctgga aacttccttt tcttagttgt 18 0 
tgtattcttg aagagcctgg gccatgaaga gcttgcctaa gttttgggca gtgaactcct 240 
tgatgttctg gcagtaagtg tttatctggc ctgcaatgag cagcgagtcc a 2 91 

<210> 302 
<211> 341 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_feature 
<222> 25 

<223> n = A,T,C or G 



<400> 302 

tgatttttca taattttatt aaatnatcac 
attacactac aatctgatag gagtggtaaa 
aaacgccacc ttttattgtc ctgtcttatt 
ttcatgagcc agcagtggac ttgagttaca 
gaagaagcca tcaaattctt gaggacttga 
cccccgggct gcaggaattc gatatcaagc 

<210> 303 
<211> 361 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 15, 27, 92, 124, 127, 183, 

<223> n = A,T,C or G 



tgggaaaact aatggttcgc gtatcacaca 60 
accagccaat ggaatccagg taaagtacaa 120 
tctcgggaag gagggttcta ctttacacat 180 
atgtgtaggt tccttgtggt tatagctgca 240 
catctctcgg aaagaagcaa actagtggat 300 
ttatcgatac c 341 



<400> 303 

tgcagacagt aaatnaattt 
gctccgtgac agcccaccaa 
caanaanatg gaaggatctc 
tanacagacg gagttganat 
ccanacttca tcccagccgg 
actttgccgc agttccaggn 



tatttgngtt cacagaacat 
cccccaaccc tntacctcgc 
acggatctca ttcctaatgg 
gctggaggat gcagtcacct 
gacgtcctcc cccacccgag 
gtcctgcttc caccagtccc 



actaggcgat ctcgacagtc 60 
agccacccta aaggcgactt 120 
tccgccgaag tctcacacag 180 
cctaaactta cgacccacca 240 
tcctccccat ttcttctcct 300 
acaaagctca ataaatacca 360 
361 



<210> 304 
<211> 301 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 23, 104, 192 
<223> n = A,T,C or G 



<400> 304 

ctctttacaa cagcctttat ttncggccct 
tagctccgcc cgccaggctc tgtgccgcct 
ctcaggggct tgaggccgta ctcccccagc 
aaggtcagcc anaacaggtc gtcctgcaca 



tgatcctgct cggatgctgg tggaggccct 60 
ccccgcaggc gcanattcat gaacacggtg 120 
gggagctggt cctccagggg cttcccctcg 18 0 
ccctccagcc cgctcacttg ctgcttcagg 240 
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tgggccacgg tctgcgtcag ccgcacctcg taggtgctgc tgcggccctt gttattcctc 300 
a ' 301 

<210> 305 
<211> 331 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 3, 36, 60, 193, 223 

<223> n = A,T,C or G 

<400> 305 

ganaggctag taacatcagt tttattgggt tggggnggca accatagcct ggctgggggn 60 

ggggctggcc ctcacaggtt gttgagttcc agcagggtct ggtccaaggt ctggtgaatc 120 

tcgacgttct cctccttggc actggccaag gtctcttcta ggtcatcgat ggttttctcc 180 

aactttgcca canacctctc ggcaaactct gctcgggtct cancctcctt cagcttctcc 24 0 

tccaacagtt tgatctcctc ttcatattta tcttctttgg gggaatactc ctcctctgag 300 

gccatcaggg acttgagggc ctggtccatg g 331 

<210> 306 
<211> 457 
<212> DNA 
<213> Homo sapiens 

<400> 306 

aatatgtaaa ggtaataact tttattatat taaagacaat gcaaacgaaa aacagaattg 60 

agcagtgcaa aatttaaagg actgttttgt tctcaaagtt gcaagtttca aagccaaaag 120 

aattatatgt atcaaatata taagtaaaaa aaagttagac tttcaagcct gtaatcccag 180 

cactttggga ggctgaggca ggtggatcac taacattaaa aagacaacat tagattttgt 240 

cgatttatag caattttata aatatataac tttgtcactt ggatcctgaa gcaaaataat 300 

aaagtgaatt tgggattttt gtacttggta aaaagtttaa caccctaaat tcacaactag 360 

tggatccccc gggctgcagg aattcgatat caagcttatc gataccgtcg acctcgaggg 420 

ggggcccggt acccaattcg ccctatagtg agtcgta 457 

<210> 307 
<211> 491 
<212> DNA 

<213> Homo sapiens 
<400> 307 

gtgcttggac ggaacccggc gctcgttccc caccccggcc ggccgcccat agccagccct 60 

ccgtcacctc ttcaccgcac cctcggactg ccccaaggcc cccgccgccg ctccagcgcc 12 0 

gcgcagccac cgccgccgcc gccgcctctc cttagtcgcc gccatgacga ccgcgtccac 180 

ctcgcaggtg cgccagaact accaccagga ctcagaggcc gccatcaacc gccagatcaa 240 

cctggagctc tacgcctcct acgtttacct gtccatgtct tactactttg accgcgatga 300 

tgtggctttg aagaactttg ccaaatactt tcttcaccaa tctcatgagg agagggaaca 3 60 

tgctgagaaa ctgatgaagc tgcagaacca acgaggtggc cgaatcttcc ttcaggatat 420 

caagaaacca gactgtgatg actgggagag cgggctgaat gcaatggagt gtgcattaca 480 

tttggaaaaa a 491 

<210> 308 

<211> 421 

<212> DNA 

<213> Homo sapiens 



<400> 308 
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ctcagcgctt cttctttctt ggtttgatcc 
aggccctgga tgtgatggtg tccaccttcc 
tcaagctcaa caagtcagaa ctaaaggagc 
ggaaaaggac agatgaagct gctttccaga 
acaacgaggt ggacttccaa gagtactgtg 
acgaattctt tgaaggcttc ccagataagc 
gtggttgggg ggtctgccag ctggggccct 



tgactgctgt catggcgtgc cctctggaga 60 
acaagtactc gggcaaagag ggtgacaagt 12 0 
tgctgacccg ggagctgccc agcttcttgg 18 0 
agctgatgag caacttggac agcaacaggg 24 0 
tcttcctgtc ctgcatcgcc atgatgtgta 300 
agcccaggaa gaaatgaaaa ctcctctgat 360 
ccctgtcgcc agtgggcact tttttttttc 42 0 
421 



<210> 309 
<211>,321 
<212> DNA 
<213> Homo sapiens 



<400> 309 

accaaatggc ggatgacgcc ggtgcagcgg gggggcccgg gggccctggt ggccctggga 60 

tggggaaccg cggtggcttc cgcggaggtt tcggcagtgg catccggggc cggggtcgcg 120 

gccgtggacg gggccggggc cgaggccgcg gagctcgcgg aggcaaggcc gaggataagg 180 

agtggatgcc cgtcaccaag ttgggccgct tggtcaagga catgaagatc aagtccctgg 240 

aggagatcta tctcttctcc ctgcccatta aggaatcaga gatcattgat ttcttcctgg 300 

gggcctctct caaggatgag g 321 

<210> 310 
<211> 381 
<212> DNA 
<213> Homo sapiens 

<400> 310 

ttaaccagcc atattggctc aataaatagc ttcggtaagg agttaatttc cttctagaaa 60 

tcagtgccta tttttcctgg aaactcaatt ttaaatagtc caattccatc tgaagccaag 120 

ctgttgtcat tttcattcgg tgacattctc tcccatgaca cccagaaggg gcagaagaac 180 

cacatttttc atttatagat gtttgcatcc tttgtattaa aattattttg aaggggttgc 240 

ctcattggat ggcttttttt tttttcctcc agggagaagg ggagaaatgt acttggaaat 300 

taatgtatgt ttacatctct ttgcaaattc ctgtacatag agatatattt tttaagtgtg 360 

aatgtaacaa catactgtga a 381 



<210> 311 
<211> 538 
<212> DNA 
<213> Homo sapiens 



<400> 311 

tttgaattta caccaagaac ttctcaataa 
cataccacaa gagaagttaa tttcttaaca 
accaagttct gatatctttt aaagacatag 
tgaaaatatc cttgttgtgt attaggtttt 
gtcatcagta ccctcctatt cagctcccca 
ggttttcttc ttatttttag ataattcaag 
tttatggtaa actcttttaa agaaaattta 
ttaaatcttt atcatagact ctgtacatat 
atcatcggtg ggatgaoaga acaaacatat 



aagaaaatca tgaatgctco acaatttcaa 60 

ttgtgttcta tgattatttg taagaccttc 120 

ttcaaaattg cttttgaaaa tctgtattct 180 

taaataccag ctaaaggatt acctcactga 240 

agatgatgtg tttttgctta ccctaagaga 300 

tgcttagata aattatgttt tctttaagtg 360 

atatgttata gctgaatctt tttggtaact 42 0 

gttcaaatta gctgcttgcc tgatgtgtgt 480 

ttatgatcat gaataatgtg ctttgtaa 53 8 



<210> 312 
<211> 176 
<212> DNA 
<213> Homo sapiens 



<400> 312 
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ggaggagcag ctgagagata gggtcagtga atgcggttca gcctgctacc tctcctgtct 60 
tcatagaacc attgccttag aattattgta tgacacgttt tttgttggtt aagctgtaag 12 0 
gttttgttct ttgtgaacat gggtattttg aggggagggt ggagggagta gggaag 17 6 



<210> 313 
<211> 396 
<212> DNA 
<213> Homo sapiens 



<400> 313 

ccagcacccc caggccctgg gggacctggg 
tggcgctccc atggctcttg caacatctcc 
agccaccagc ccctcactgg gttcggagga 
gaaacatcgg atttggggaa cgcgtgtcaa 
actgttctgt tccttgtgta actgtgttgc 
gtcaccgggg caactgcctg ggggcgggga 
tataccaaag gtgctacatc tatgtgatgg 

<210> 314 
<211> 311 
<212> DNA 
<213> Homo sapiens 



ttctcagact gccaaagaag ccttgccatc 60 
ccttcgtttt tgagggggtc atgccggggg 12 0 
gagtcaggaa gggccaagca cgacaaagca 18 0 
tcccttgtgc cgcagggctg ggcgggagag 24 0 
tgaaagacta cctcgttctt gtcttgatgt 300 
tgggggcagg gtggaagcgg ctccccattt 360 
gtgggg 39 6 



<400> 314 

cctcaacatc ctcagagagg actggaagcc 
cctgcagtat ctcttcttgg agcccaaccc 
ggtcctgcag aacaaccggc ggctgtttga 
ctacatcggc tccacctact ttgagcgctg 
cgccacggcc acaagccctg gcatcccctg 
tttggggggc g 

<210> 315 
<211> 336 
<212> DNA 
<213> Homo sapiens 



agtccttacg ataaactcca taatttatgg 60 
cgaggaccca ctgaacaagg aggccgcaga 12 0 
gcagaacgtg cagcgctcca tgcggggtgg 18 0 
cctgaaatag ggttggcgca tacccacccc 24 0 
caaatattta ttgggggcca tgggtagggg 300 
311 



<400> 315 

tttagaacat ggttatcatc caagactact 
aatccacatt cctcttgagt tctgcagctt 
cgtagaatca catgatctga ggaccattca 
gtcttccata aagttttgca tggagcaaac 
agccctctaa aagcataggg cttagcctgc 
gttttgtaaa cactatagca tctgttaaga 

<210> 316 
<211> 436 
<212> DNA 
<213> Homo sapiens 



ctaccctgca acattgaact cccaagagca 60 
ctgtgtaaat agggcagctg tcgtctatgc 12 0 
tggaagctgc taaatagcct agtctgggga 18 0 
aaacaggatt aaactaggtt tggttccttc 24 0 
aggcttcctt gggctttctc tgtgtgtgta 300 
tccagt 336 



<400> 316 

aacatggtct gcgtgcctta agagagacgc 
atgtttccat tggaattgtt ggtaaagact 
tgtctccatt cctggaaggt cttgaagaaa 
ctgctgatga acctgcagaa aaggctgatg 
ctatatatgt attatcaaat atgtaagaat 
atactttgaa ccaaaagttg cagagtggtg 
gtgagttttt tccaagcaac ctcactgaaa 
agggtctgta taatca 



ttcctgcaga acaggacctg actacaaaga 60 

tggagtttac aatctatgat gatgatgatg 120 

gaccacagag aaaggcacag cctgctcaac 180 

aaccaatgga acattaagtg ataagccagt 240 

acaggcacca catactgatg acaataatct 300 

gaatgctatg ttttaggaat cagtccagat 360 

cctatataat ggaatacatt tttctttgaa 42 0 
436 
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<210> 317 
<211> 196 
<212> DNA 
<213> Homo sapiens 



<400> 317 

tattccttgt gaagatgata tactattttt 
gctgctggct tgcagtgcgc gtgcacgtgg 
atgctccctc ccctgccctg gtccagggaa 
atctgcccct ccccca 



gttaagcgtg tctgtattta tgtgtgagga 60 
agagctggtg cccggagatt ggacggcctg 12 0 
gctggccgag ggtcctggct cctgaggggc 180 
196 



<210> 318 
<211> 381 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 8, 9, 102, 122, 167, 182, 1'93, 235, 253, 265, 266, 290, 321, 
378 

<223> n = A,T,C or G 



<400> 318 

gacgcttnng ccgtaacgat gatcggagac 
gccggggcgg tgctgaactt taagctgaaa 
tncagggagc ccaacacagg tgacaacatc 
cnaatcttca tcnccctgtg gaacatcttc 
tcttgaatcc cancgatgaa accannaact 
tccattcctg atgacttcaa naatgttttt 
tccaagctcg tggtgggngg a 



atcctgctgt tcgggacgtt gctgatgaat 60 
aagaaggaca cncagggctt tggggaggag 12 0 
cgggaattct tgctgancct cagatacttt 18 0 
atgatgttct gcatgattgt gctgntcggc 240 
cactttcccg ggatgccgan tctccattcc 300 
gaccaaaaaa ccgacaacct tcccagaaag 360' 
381 



<210> 319 
<211> 506 
<212> DNA 
<213> Homo sapiens 



<400> 319 

ctaagcttta cgaatggggt gacaacttat 
tttgtaaata cctttgttat aattgatagg 
cctctgagca gtgtatgtca ggacttgttc 
ttatacaggt agagatgtat gcagatgtgt 
ccattgatgt atgcatctct tggctgtact 
ctttgctaat attttaatgg tatagatctg 
tctgttgctg tgtgtttcat tttaaattga 
actctgccaa tgcttttatc tagaggcgtg 
tcccaagaaa ggcaggatta catctt 



gataaaaact agagctagtg aattagccta 60 
atacatcttg gacatggaat tgttaagcca 12 0 
attaggttgg cagcagaggg gcagaaggaa 180 
ccatatatgt ccatatttac attttgatag 240 
ataagaacac attaattcaa tggaaataca 300 
ctaatgaatt ctcttaaaaa catactgtat 360 
gcattaaggg aatgcagcat ttaaatcaga 42 0 
ttgccatttt tgtcttatat gaaatttctg 480 
506 



<210> 320 
<211> 351 
<212> DNA 
<213> Homo sapiens 



<400> 320 

ctgacctgca ggacgaaacc atgaagagcc 
cggtagtaac tttgtgttat gaatcacatg 
tcattaacag gagaaatgca aataccttca 
tccaagagag gatccgagaa cgctctaagc 



tgatccttct tgccatcctg gccgccttag 60 
aaagcatgga atcttatgaa cttaatccct 120 
tatcccctca gcagagatgg agagctaaag 18 0 
ctgtccacga gctcaatagg gaagcctgtg 24 0 
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atgactacag actttgcgaa cgctacgcca tggtttatgg atacaatgct gcctataatc 300 
gctacttcag gaagcgccga gggaccaaat gagactgagg gaagaaaaaa a 351 

<210> 321 
<211> 421 
<212> DNA 

<213> Homo sapiens 



<400> 321 

ctcggaggcg ttcagctgct tcaagatgaa 
ccagaaactc attgaagtgg acgatgaacg 
ggccacagaa gttgctgctg acgctctggg 
cagtggtggg aacgacaaac aaggtttccc 
tgtccgcctg ctactgagta aggggcattc 
aaagagaaaa tcagttcgtg gttgcattgt 
tattgtaaaa aaaggagaga aggatattcc 



gctgaacatc tccttcccag ccactggctg 60 
caaacttcgt actttctatg agaagcgtat 12 0 
tgaagaatgg aagggttatg tggtccgaat 18 0 
catgaagcag ggtgtcttga cccatggccg 240 
ctgttacaga ccaaggagaa ctggagaaag 300 
ggatgcaaat ctgagcgttc tcaacttggt 3 60 
tggactgact gatactacag tgcctcgccg 42 0 
421 



<210> 322 
<211> 521 
<212> DNA 
<213> Homo sapiens 



<400> 322 

agcagctctc ctgccacagc tcctcacccc 
tccactccct ccttggtcaa gagcacctca 
gtgctgaaac gaccggagat actgacagat 
ccccttacct cacttgtctc tagccgcagc 
gacacagcag ccaagttcat tggagctggg 
gctgggattg gaactgtgtt tgggagcctc 
aagcaacagc tcttctccta cgccattctg 
ttttgtctga tggtagcctt tctcatcctc 
ccatagttct cccgcgtctg gttggccccg 

<210> 323 
<211> 435 
<212> DNA 
<213> Homo sapiens 



ctgaaaatgt tcgcctgctc caagtttgtc 60 
cagctgctga gccgtccgct atctgcagtg 120 
gagagcctca gcagcttggc agtctcatgt 180 
ttccaaacca gcgccatttc aagggacatc 240 
gctgccacag ttggggtggc tggttctggg 300 
atcattggtt atgccaggaa cccttctctg 3 60 
ggctttgccc tctcggaggc catggggctc 420 
tttgccatgt gaaggagccg tctccacctc 480 
tgtgttcctt t 521 



<400> 323' 

ccgaggtcgc acgcgtgaga cttctccgcc 
tcctacctgc tggctgccct agggggcaac 
atcttggaca gcgtgggtat cgaggcggac 
ctgaatggaa aaaacattga agacgtcatt 
cctgctggtg gggctgtagc cgtctctgct 
tctgcccctg ctgcagcaga ggagaagaaa 
gatgatgaca tgggatttgg cctttttgat 
ttttacacat ctcaa 



gcagacgccg ccgcgatgcg ctacgtcgcc 60 
tcctccccca gcgccaagga catcaagaag 12 0 
gacgaccggc tcaacaaggt tatcagtgag 180 
gcccagggta ttggcaagct tgccagtgta 240 
gccccaggct ctgcagcccc tgctgctggt 300 
gatgagaaga aggaggagtc tgaagagtca 360 
taaattcctg ctcccctgca aataaagcct 42 0 
435 



<210> 324 
<211> 521 
<212> DNA 
<213> Homo sapiei 



<400> 324 

aggagatcga ctttcggtgc ccgcaagacc 
tggtgcagta caagaatcgt caggccatcc 
agcacctggt ccagcagcag cccccctcgc 



agggctggaa cgccgagatc acgctgcaga 60 
tggcggtcaa atccacgcgg cagaagcagc 12 0 
agccgcagcc gcagccgcag ctccagcccc 18 0 
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aacccaagcc 


tcagccccag 


cagctccacc 


ctcatcctca 


ctcgcaccca 


caccctcacc 


cacacccaca 


gccgcactcg 


cagccgcacg 


ctgcctgaaa 


ggggcagctc 


ccgggcaaga 


gagcacattt 


ctattgtctt 


cacttggatc 


<210> 325 






<211> 451 






<212> DNA 






<213> Homo 


sapiens 





agccccaatc acaaccccag cctcagcccc 240 

cgtatccgca tccacatcca catccacact 300 

cgcacccgca tccgcaccaa ataccgcacc 3 60 

ggcaccggct tctccgcagc acctccaact 420 

caaggttttg aggacttgag gaagtgggac 48 0 

aaaagcaaaa c 521 



<400> 325 

attttcattt ccattaacct ggaagctttc atgaatattc tcttctttta aaacatttta 60 
acattattta aacagaaaaa gatgggctct ttctggttag ttgttacatg atagcagaga 12 0 
tatttttact tagattactt tgggaatgag agattgttgt cttgaactct ggcactgtac 180 
agtgaatgtg tctgtagttg tgttagtttg cattaagcat gtataacatt caagtatgtc 24 0 
atccaaataa gaggcatata cattgaattg tttttaatcc tctgacaagt tgactcttcg 300 
acccccaccc ccacccaaga cattttaata gtaaatagag agagagagaa gagttaatga 360 
acatgaggta gtgttccact ggcaggatga cttttcaata gctcaaatca atttcagtgc 420 
ctttatcact tgaattatta acttaatttg a 451 

<210> 326 
<211> 421 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 296 

<223> n = A,T,C or G 



<400> 326 

cgcggtcgta agggctgagg atttttggtc 
tcgctctcgc cgaggaacaa gtcggtcagg 
ggataccgga aaaacacccg tggagccgga 
aacaagccgc aacgtaaaat ccttggaaaa 
agaaaagaat ctcaaagtga aaggaccagt 
tacaagaaaa actccttgtg gtgaaggttc 
tcacaagcga ctcattgact tgcacagtcc 



cgcacgctcc tgctcctgac tcaccgctgt 60 

aagcccgcgc gcaacagcca tggcttttaa 120 

ggtggcaatt caccgaattc gaatcaccct 180 

ggtgtgtgct gacttgataa gaggcgcaaa 240 

tcgaatgcct accaagactt tgagantcac 300 

taagacgtgg gatcgtttcc agatgagaat 360 

ttctgagatt gttaagcaga ttacttccat 42 0 
421 



<210> 327 
<211> 456 
<212> DNA 

<213> Homo sapiens 
<400> 327 

atcttgacga ggctgcggtg tctgctgcta ttctccgagc ttcgcaatgc cgcctaagga 60 
cgacaagaag aagaaggacg ctggaaagtc ggccaagaaa gacaaagacc cagtgaacaa 120 
atccgggggc aaggccaaaa agaagaagtg gtccaaaggc aaagttcggg acaagctcaa 18 0 
taacttagtc ttgtttgaca aagctaccta tgataaactc tgtaaggaag ttcccaacta 240 
taaacttata accccagctg tggtctctga gagactgaag attcgaggct ccctggccag 300 
ggcagccctt caggagctcc ttagtaaagg acttatcaaa ctggtttcaa agcacagagc 360 
tcaagtaatt tacaccagaa ataccaaggg tggagatgct ccagctgctg gtgaagatgc 42 0 
atgaataggt ccaaccagct gtacatttgg aaaaat 45 6 



<210> 328 
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<211> 471 
<212> DNA 

<213> Homo sapiens 



aaaccctgcg tggcaatccc tgacgcaccg ccgtgatgcc 60 

ggaagtccaa ctacttcctt aagatcatcc aactattgga 120 

ttgtgggagc agacaatgtg ggctccaagc agatgcagca 180 

ggaaggctgt ggtgctgatg ggcaagaaca ccatgatgcg 24 0 

tggaaaacaa cccagctctg gagaaactgc tgcctcatat 300 

tgttcaccaa ggaggacctc actgagatca gggacatgtt 360 

ctgctgcccg tgctggtgcc attgccccat gtgaagtcac 420 

gtctcgggcc cgagaagacc tcctttttcc a 471 

<210> 329 
<211> 278 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 154, 204 
<223> n = A,T,C or G 



<400> 328 

gtggaagtga catcgtcttt 
cagggaagac agggcgacct 
tgattatccg aaatgtttca 
gatccgcatg tcccttcgcg 
caaggccatc cgagggcacc 
ccgggggaat gtgggctttg 
gctggccaat aaggtgccag 
tgtgccagcc cagaacactg 



<400> 329 

gtttaaactt aagcttggta ccgagctcgg 

aaattgagat gcccccccag gccagcaaat 

ccttgatatt tttctttttt tttttttttt 

aggtgctatt taacatggga gganagcgtg 

tccaccctct ctccacctgc ctctggcttc 



atccactagt ccagtgtggt ggaattctag 60 
gttccttttt gttcaaagtc tatttttatt 120 
ttgnggatgg ggacttgtga atttttctaa 180 
tgcggctcca gcccagcccg ctgctcactt 240 
tcaggcct 27 8 



<210> 330 
<211> 338 
<212> DNA 

<213> Homo sapiens 



<400> 330 

ctcaggcttc aacatcgaat acgccgcagg 
cacaaacatt attataataa acaccctcac 
cgcactctcc cctgaactct acacaacata 
cctgttctta tgaattcgaa cagcataccc 
cctatgaaaa aacttcctac cactcaccct 
cattacaatc tccagcattc cccctcaaac 

<210> 331 
<211> 2820 
<212> DNA 

<213> Homo sapiens 



ccccttcgcc ctattcttca tagccgaata 60 
cactacaatc ttcctaggaa caacatatga 12 0 
ttttgtcacc aagaccctac ttctaacctc 18 0 
ccgattccgc tacgaccaac tcatacacct 24 0 
agcattactt atatgatatg tctccatacc 300 
ctaaaaaa 338 



<400> 331 

tggcaaaatc ctggagccag aagaaaggac 
gttgtacctg gaaaacaatg cccagactca 
gctcctgaac agcatggacc agcagattcg 
cacagaccac gcgcagaaca gcgtcacggc 
cttcgatgct ctctctccat cacccgccat 
cagttccgac gtgtccttcc agcagtcgag 
cactgaactg aagaaactct actgccaaat 
gatgacccca cctcctcagg gagctgttat 



agcagcattg atcaatctta cagctaacat 60 
atttagtgag ccacagtaca cgaacctggg 120 
gaacggctcc tcgtccacca gtccctataa 18 0 
gccctcgccc tacgcacagc ccagccccac 24 0 
cccctccaac accgactacc caggcccgca 300 
caccgccaag tcggccacct ggacgtattc 360 
tgcaaagaca tgccccatcc agatcaaggt 42 0 
ccgcgccatg cctgtctaca aaaaagctga 48 0 
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gcacgtcacg gaggtggtga agcggtgccc caaccatgag ctgagccgtg agttcaacga 54 0 
gggacagatt gcccctccta gtcatttgat tcgagtagag gggaacagcc atgcccagta 600 
tgtagaagat cccatcacag gaagacagag tgtgctggta ccttatgagc caccccaggt 660 
tggcactgaa ttcacgacag tcttgtacaa tttcatgtgt aacagcagtt gtgttggagg 72 0 
gatgaaccgc cgtccaattt taatcattgt tactctggaa accagagatg ggcaagtcct 780 
gggccgacgc tgctttgagg cccggatctg tgcttgccca ggaagagaca ggaaggcgga 84 0 
tgaagatagc atcagaaagc agcaagtttc ggacagtaca aagaacggtg atggtacgaa 90 0 
gcgcccgttt cgtcagaaca cacatggtat ccagatgaca tccatcaaga aacgaagatc 960 
cccagatgat gaactgttat acttaccagt gaggggccgt gagacttatg aaatgctgtt 1020 
gaagatcaaa gagtccctgg aactcatgca gtaccttcct cagcacacaa ttgaaacgta 1080 
caggcaacag caacagcagc agcaccagca cttacttcag aaacagacct caatacagtc 1140 
tccatcttca tatggtaaca gctccccacc tctgaacaaa atgaacagca tgaacaagct 12 00 
gccttctgtg agccagctta tcaaccctca gcagcgcaac gccctcactc ctacaaccat 12 60 
tcctgatggc atgggagcca acattcccat gatgggcacc cacatgccaa tggctggaga 132 0 
catgaatgga ctcagcccca cccaggcact ccctccccca ctctccatgc catccacctc 1380 
ccactgcaca cccccacctc cgtatcccac agattgcagc attgtcagtt tcttagcgag 14 4 0 
gttgggctgt tcatcatgtc tggactattt cacgacccag gggctgacca ccatctatca 1500 
gattgagcat tactccatgg atgatctggc aagtctgaaa atccctgagc aatttcgaca 15 60 
tgcgatctgg aagggcatcc tggaccaccg gcagctccac gaattctcct ccccttctca 1620 
t.ctcctgcgg accccaagca gtgcctctac agtcagtgtg ggctccagtg agacccgggg 1680 
tgagcgtgtt attgatgctg tgcgattcac cctccgccag accatctctt tcccaccccg 1740 
agatgagtgg aatgacttca actttgacat ggatgctcgc cgcaataagc aacagcgcat 1800 
caaagaggag ggggagtgag cctcaccatg tgagctcttc ctatccctct cctaactgcc 18 60 
agccccctaa aagcactcct gcttaatctt caaagccttc tccctagctc ctccccttcc 1920 
tcttgtctga tttcttaggg gaaggagaag taagaggcta cctcttacct aacatctgac 1980 
ctggcatcta attctgattc tggctttaag ccttcaaaac tatagcttgc agaactgtag 2040 
ctgccatggc taggtagaag tgagcaaaaa agagttgggt gtctccttaa gctgcagaga 2100 
tttctcattg acttttataa agcatgttca cccttatagt ctaagactat atatataaat 2160 
gtataaatat acagtataga tttttgggtg gggggcattg agtattgttt aaaatgtaat 2220 
ttaaatgaaa gaaaattgag ttgcacttat tgaccatttt ttaatttact tgttttggat 22 8 0 
ggcttgtcta tactccttcc cttaaggggt atcatgtatg gtgataggta tctagagctt 2340 
aatgctacat gtgagtgoga tgatgtacag attctttcag ttctttggat tctaaataca 2 400 
tgccacatca aacctttgag tagatccatt tccattgctt attatgtagg taagactgta 24 60 
gatatgtatt cttttctcag tgttggtata ttttatatta ctgacatttc ttctagtgat 2520 
gatggttcac gttggggtga tttaatccag ttataagaag aagttcatgt ccaaacggtc 2580 
ctctttagtt tttggttggg aatgaggaaa attcttaaaa ggcccatagc agccagttca 2640 
aaaacacccg acgtcatgta tttgagcata tcagtaaccc ccttaaattt aatacccaga 2700 
taccttatct tacaatgttg attgggaaaa catttgctgc ccattacaga ggtattaaaa 27 60 
ctaaatttca ctactagatt gactaactca aatacacatt tgctactgtt gtaagaattc 2820 



<210> 332 

<211> 2270 

<212> DNA 

<213> Homo sapiens 



<400> 332 

tcgttgatat caaagacagt tgaaggaaat 
acagtactgc cctgaccctt acatccagcg 
aaagaaagtt attaccgatc caccatgtcc 
ccagaggttt tccagcatat ctgggatttt 
attgacttga actttgtgga tgaaccatca 
agcatggact gtatccgcat gcaggactcg 
acgaacctgg ggctcctgaa cagcatggac 
agtccctata acacagacca cgcgcagaac 
cccagctcca ccttcgatgc tctctctcca 
ccaggcccgc acagtttcga cgtgtccttc 
tggacgtatt ccactgaact gaagaaactc 



gaattttgaa acttcacggt gtgccaccct 60 
tttcgtagaa acccagctca tttctcttgg 12 0 
cagagcacac agacaaatga attcctcagt 18 0 
ctggaacagc ctatatgttc agttcagccc 240 
gaagatggtg cgacaaacaa gattgagatt 300 
gacctgagtg accccatgtg gccacagtac 360 
cagcagattc agaacggctc ctcgtccacc 42 0 
agcgtcacgg cgccctcgcc ctacgcacag 480 
tcacccgcca tcccctccaa caccgactac 54 0 
cagcagtcga gcaccgccaa gtcggccacc 600 
tactgccaaa ttgcaaagac atgccccatc 660 
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cagatcaagg tgatgacccc acctcctcag ggagctgtta tccgcgccat gcctgtctac 720 
aaaaaagctg agcacgtcac ggaggtggtg aagcggtgcc ccaaccatga gctgagccgt 78 0 
gaattcaacg agggacagat tgcccctcct agtcatttga ttcgagtaga ggggaacagc 840 
catgcccagt atgtagaaga tcccatcaca ggaagacaga gtgtgctggt accttatgag 900 
ccaccccagg ttggcactga attcacgaca gtcttgtaca atttcatgtg taacagcagt 960 
tgtgttggag ggatgaaccg ccgtccaatt ttaatcattg ttactctgga aaccagagat 1020 
gggcaagtcc tgggccgacg ctgctttgag gcccggatct gtgcttgccc aggaagagac 1080 
aggaaggcgg atgaagatag catcagaaag cagcaagttt cggacagtac aaagaacggt 1140 
gatggtacga agcgcccgtt tcgtcagaac acacatggta tccagatgac atccatcaag 1200 
aaacgaagat ccccagatga tgaactgtta tacttaccag tgaggggccg tgagacttat 12 60 
gaaatgctgt tgaagatcaa agagtccctg gaactcatgc agtaccttcc tcagcacaca 1320 
attgaaacgt acaggcaaca gcaacagcag cagcaccagc acttacttca gaaacagacc 138 0 
tcaatacagt ctccatcttc atatggtaac agctccccac ctctgaacaa aatgaacagc 1440 
atgaacaagc tgccttctgt gagccagctt atcaaccctc agcagcgcaa cgccctcact 15 00 
cctacaacca ttcctgatgg catgggagcc aacattccca tgatgggcac ccacatgcca 15 60 
atggctggag acatgaatgg actcagcccc acccaggcac tccctccccc actctccatg 1620 
ccatccacct cccactgcac acccccacct ccgtatccaa cagattgcag cattgtcggt 168 0 
ttcttagcga ggttgggctg ttcatcatgt ctggactatt tcacgaccca ggggctgacc 1740 
accatctatc agattgagca ttactccatg gatgatctgg caagtctgaa aatccctgag 1800 
caatttcgac atgcgatctg gaagggcatc ctggaccacc ggcagctcca cgaattctcc 1860 
tccccttctc atctcctgcg gaccccaagc agtgcctcta cagtcagtgt gggctccagt 1920 
gagacccggg gtgagcgtgt tattgatgct gtgcgattca ccctccgcca gaccatctct 1980 
ttcccacccc gagatgagtg gaatgacttc aactttgaca tggatgctcg ccgcaataag 2040 
caacagcgca tcaaagagga gggggagtga gcctcaccat gtgagctctt cctatccctc 2100 
tcctaactgc cagcccccta aaagcactcc tgcttaatct tcaaagcctt ctccctagct 2160 
cctccccttc ctcttgtctg atttcttagg ggaaggagaa gtaagaggct acctcttacc 2220 
taacatctga cctggcatct aattctgatt ctggctttaa gccttcaaaa 2270 

<210> 333 
<211> 2816 
<212> DNA 
<213> Homo sapiens 

<400> 333 

tcgttgatat caaagacagt tgaaggaaat gaattttgaa acttcacggt gtgccaccct 60 

acagtactgc cctgaccctt acatccagcg tttcgtagaa acccagctca tttctcttgg 12 0 

aaagaaagtt attaccgatc caccatgtcc cagagcacac agacaaatga attcctcagt 180 

ccagaggttt tccagcatat ctgggatttt ctggaacagc ctatatgttc agttcagccc 240 

attgacttga actttgtgga tgaaccatca gaagatggtg cgacaaacaa gattgagatt 300 

agcatggact gtatccgcat gcaggactcg gacctgagtg accccatgtg gccacagtac 360 

acgaacctgg ggctcctgaa cagcatggac cagcagattc agaacggctc ctcgtccacc 42 0 

agtccctata acacagacca cgcgcagaac agcgtcacgg cgccctcgcc ctacgcacag 480 

cccagctcca ccttcgatgc tctctctcca tcacccgcca tcccctccaa caccgactac 540 

ccaggcccgc acagtttcga cgtgtccttc cagcagtcga gcaccgccaa gtcggccacc 600 

tggacgtatt ccactgaact gaagaaactc tactgccaaa ttgcaaagac atgccccatc 660 

cagatcaagg tgatgacccc acctcctcag ggagctgtta tccgcgccat gcctgtctac 72 0 

aaaaaagctg agcacgtcac ggaggtggtg aagcggtgcc ccaaccatga gctgagccgt 78 0 

gaattcaacg agggacagat tgcccctcct agtcatttga ttcgagtaga ggggaacagc 840 

catgcccagt atgtagaaga tcccatcaca ggaagacaga gtgtgctggt accttatgag 900 

ccaccccagg ttggcactga attcacgaca gtcttgtaca atttcatgtg taacagcagt 960 

tgtgttggag ggatgaaccg ccgtccaatt ttaatcattg ttactctgga aaccagagat 1020 

gggcaagtcc tgggccgacg ctgctttgag gcccggatct gtgcttgccc aggaagagac 1080 

aggaaggcgg atgaagatag catcagaaag cagcaagttt cggacagtac aaagaacggt 114 0 

gatggtacga agcgcccgtt tcgtcagaac acacatggta tccagatgac atccatcaag 1200 

aaacgaagat ccccagatga tgaactgtta tacttaccag tgaggggccg tgagacttat 12 60 

gaaatgctgt tgaagatcaa agagtccctg gaactcatgc agtaccttcc tcagcacaca 1320 

attgaaacgt acaggcaaca gcaacagcag cagcaccagc acttacttca gaaacatctc 1380 

ctttcagcct gcttcaggaa tgagcttgtg gagccccgga gagaaactcc aaaacaatct 14 4 0 
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gacgtcttct ttagacattc caagccccca 
tctatatttt aagtgtgtgt gttgtatttc 
tgtgtgtgcg tgtgtatcta gccctcataa 
cccaactgct caaaggcaca aagccactag 
ttacaagaaa ggatgttttc tgcagatttt 
gaaccactgt gtttgtctgt gagctttctg 
gaaaggggca ttaagatgtt tattggaacc 
aattcacagg gaagcttttg agcaggtctc 
aaaaaagttg ttattgtctg tgcataagta 
cccttttaat gctggtcatg taataatatt 
tactgctggg cagcgaggtg atcattacca 
tttgtgagaa cttgcattat ttgtgtcctc 
gctgtgtacc tgcctctgcc actgtatgtt 
catgaaaccc tggaagacct actacaaaaa 
ctcattttgt gcttttaata gaaagacaaa 
tgtttaccat tattcaaagc tcaaaataga 
aatttgctta attagagctt ctatccctca 
ctgatactgt tcagtgcatt tagccaggag 
agacgtgtta aaatcagcac tcctggactg 
ttcttttttt tactcaaaag tttagagaat 
ttaagataat agcataaaga ctttaaaaat 
caccagcact gtattttctg tcaccaagac 
ttgtggatgt gtgattttaa ttttcaataa 

<210> 334 
<211> 2082 
<212> DNA 

<213> Homo sapiens 



aaccgatcag tgtacccata gagccctatc 1500 
catgtgtata tgtgagtgtg tgtgtgtgta 1560 
acaggacttg aagacacttt ggctcagaga 1620 
tgagagaatc ttttgaaggg actcaaacct 1680 
gtatccttag accggccatt ggtgggtgag 17 40 
ttgtttcctg ggagggaggg gtcaggtggg 18 00 
cttttctgtc ttcttctgtt gtttttctaa 18 60 
aaacttaaga tgtcttttta agaaaaggag 1920 
agttgtaggt gactgagaga ctcagtcaga 198 0 
gcaagtagta agaaacgaag gtgtcaagtg 2040 
aaagtaatca actttgtggg tggagagttc 2100 
ccctcatgtg taggtagaac atttcttaat 2160 
ggcatctgtt atgctaaagt ttttcttgta 2220 
aactgttgtt tggcccccat agcaggtgaa 22 8 0 
tccaccccag taatattgcc cttacgtagt 2340 
atttgaagcc ctctcacaaa atctgtgatt 2400 
agcctaccta ccataaaacc agccatatta 24 60 
acttacgttt tgagtaagtg agatccaagc 252 0 
gaaattaaag attgaaaggg tagactactt 2580 
ctctgtttct ttccatttta aaaacatatt 2 64 0 
gttcctcccc tccatcttcc cacacccagt 2700 
aatgatttct tgttattgag gctgttgctt 2760 
acttttgcat cttggtttaa aagaaa 2816 



<400> 334 

agatgctaca gcgactgcac acccaggctg 
aacctgtcca gcatgtgatg tggtgggata 
tgtaacacag tggtaagtct ttgtgtatct 
agaatatggt attataatct tatggaacta 
gtagttatac agcacaggac tgtgcttatg 
cctttaatct tcatatcaac cctaggaggt 
tctcggggtg ggggggttgg caaaatcctg 
aatcttacag ctaacatgtt gtacctggaa 
cagtacacga acctggggct cctgaacagc 
tccaccagtc cctataacac agaccacgcg 
gcacagccca gctccacctt cgatgctctc 
gactacccag gcccgcacag tttcgacgtg 
gccacctgga cgtattccac tgaactgaag 
cccatccaga tcaaggtgat gaccccacct 
gtctacaaaa aagctgagca cgtcacggag 
agccgtgaat tcaacgaggg acagattgcc 
aacagccatg cccagtatgt agaagatccc 
tatgagccac cccaggttgg cactgaattc 
agcagttgtg ttggagggat gaaccgccgt 
agagatgggc aagtcctggg ccgacgctgc 
agagacagga aggcggatga agatagcatc 
aacggtgatg gtacgaagcg cccgtctcgt 
atcaagaaac gaagatcccc agatgatgaa 
acttatgaaa tgctgttgaa gatcaaagag 
cacacaattg aaacgtacag gcaacagcaa 
cagtgagtgt atcaacgtgt cattttagga 
agcaataggg tgattgatga gcaatgtgga 
ttcagatgac ctggtatggc aaccctcttt 



tatgatacag cctattgctc ccgggctgca 60 

ctgaattgaa taccgaatac tgtaggcaat 120 

aaacatagct aaacaccaaa aggtatagta 180 

tcattgtata tgtggtttgt caaccagaat 240 

atgtgccaag cacagctctc agtactaact 300 

aacttcttaa gtagattcat attgtaaggg 360 

gagccagaag aaaggacagc agcattgatc 420 

aacaatgccc agactcaatt tagtgagcca 480 

atggaccagc agattcagaa cggctcctcg 540 

cagaacagcg tcacggcgcc ctcgccctac 600 

tctccatcac ccgccatccc ctccaacacc 660 

tccttccagc agtcgagcac cgccaagtcg 720 

aaactctact gccaaattgc aaagacatgc 780 

cctcagggag ctgttatccg cgccatgcct 84 0 

gtggtgaagc ggtgccccaa ccatgagctg 900 

cctcctagtc atttgattcg agtagagggg 960 

atcacaggaa gacagagtgt gctggtacct 1020 

acgacagtct tgtacaattt catgtgtaac 1080 

ccaattttaa tcattgttac tctggaaacc 1140 

tttgaggccc ggatctgtgc ttgcccagga 1200 

agaaagcagc aagtttcgga cagtacaaag 12 60 

cagaacacac atggtatcca gatgacatcc 132 0 

ctgttatact taccagtgag gggccgtgag 1380 

tccctggaac tcatgcagta ccttcctcag 1440 

cagcagcagc accagcactt acttcagaaa 1500 

ggcatgagtg acggtgactt tatttggatc 1560 

acataatggg agatagcaga ttgtcataga 1620 

cagttgcaac cttttttacg tgtcttatta 1680 
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taaccttccc ttcagaattc cacttatgtt ctgaaattaa atacaaacca tttctggtga 1740 

attacaaaga aactcacact aacagttctc ttctctatat gcctggtcca tacacactaa 1800 

cagtaagtac acactctatt tggtagtgat gtgtatattt gaaaacatga aatcttttct 18 60 

catcccaatg gattgtctta taaatctcct gggatgcaca ctatccactt ttgggaataa 1920 

cactgtagac cagggatagc aaataggctt tactataata taaagtgact tgtttgaatg 1980 

ctgtaatgag aagaattctg agacctagtg catgataatt ggggaaatat ctgggtgcag 2 040 

aaggataagg tagcatcatg ttgccgtatt ttagcatctc tg 2082 

<210> 335 
<211> 4849 
<212> DNA 
<213> Homo sapiens 

<400> 335 

cgttgatatc aaagacagtt gaaggaaatg aattttgaaa cttcacggtg tgccacccta 60 
cagtactgcc ctgaccctta catccagcgt ttcgtagaaa ccccagctca tttctcttgg 120 
aaagaaagtt attaccgatc caccatgtcc cagagcacac agacaaatga attcctcagt 180 
ccagaggttt tccagcatat ctgggatttt ctggaacagc ctatatgttc agttcagccc 240 
attgacttga actttgtgga tgaaccatca gaagatggtg cgacaaacaa gattgagatt 300 
agcatggact gtatccgcat gcaggactcg gacctgagtg accccatgtg gccacagtac 360 
acgaacctgg ggctcctgaa cagcatggac cagcagattc agaacggctc ctcgtccacc 420 
agtccctata acacagacca cgcgcagaac agcgtcacgg cgccctcgcc ctacgcacag 480 
cccagctcca ccttcgatgc tctctctcca tcacccgcca tcccctccaa caccgactac 540 
ccaggcccgc acagtttcga cgtgtccttc cagcagtcga gcaccgccaa gtcggccacc 600 
tggacgtatt ccactgaact gaagaaactc tactgccaaa ttgcaaagac atgccccatc 660 
cagatcaagg tgatgacccc acctcctcag ggagctgtta tccgcgccat gcctgtctac 720 
aaaaaagctg agcacgtcac ggaggtggtg aagcggtgcc ccaaccatga gctgagccgt 780 
gaattcaacg agggacagat tgcccctcct agtcatttga ttcgagtaga ggggaacagc 840 
catgcccagt atgtagaaga tcccatcaca ggaagacaga gtgtgctggt accttatgag 900 
ccaccccagg ttggcactga attcacgaca gtcttgtaca atttcatgtg taacagcagt 960 
tgtgttggag ggatgaaccg ccgtccaatt ttaatcattg ttactctgga aaccagagat 1020 
gggcaagtcc tgggccgacg ctgctttgag gcccggatct gtgettgccc aggaagagac 1080 
aggaaggcgg atgaagatag catcagaaag cagcaagttt cggacagtac aaagaacggt 1140 
gatggtacga agcgcccgtt tcgtcagaac acacatggta tccagatgac atccatcaag 1200 
aaacgaagat ccccagatga tgaactgtta tacttaccag tgaggggccg tgagacttat 1260 
gaaatgctgt tgaagatcaa agagtccctg gaactcatgc agtaccttcc tcagcacaca 1320 
attgaaacgt acaggcaaca gcaacagcag cagcaccagc acttacttca gaaacagacc 1380 
tcaatacagt ctccatcttc atatggtaac agctccccac ctctgaacaa aatgaacagc 1440 
atgaacaagc tgccttctgt gagccagctt atcaaccctc agcagcgcaa cgccctcact 1500 
cctacaacca ttcctgatgg catgggagcc aacattccca tgatgggcac ccacatgcca 1560 
atggctggag acatgaatgg actcagcccc acccaggcac tccctccccc actctccatg 1620 
ccatccacct cccagtgcac acccccacct ccgtatccca cagattgcag cattgtcagt 1680 
ttcttagcga ggttgggctg ttcatcatgt ctggactatt tcacgaccca ggggctgacc 1740 
accatctatc agattgagca ttactccatg gatgatctgg caagtctgaa aatccctgag 1800 
caatttcgac atgcgatctg gaagggcatc ctggaccacc ggcagctcca cgaattctcc 18 60 
tccccttctc atctcctgcg gaccccaagc agtgcctcta cagtcagtgt gggctccagt 1920 
gagacccggg gtgagcgtgt tattgatgct gtgcgattca ccctccgcca gaccatctct 1980 
ttcccacccc gagatgagtg gaatgacttc aactttgaca tggatgctcg ccgcaataag 2 04 0 
caacagcgca tcaaagagga gggggagtga gcctcaccat gtgagctctt cctatccctc 2100 
tcctaactgc cagcycccta aaagcactcc tgcttaatct tcaaagcctt ctccctagct 2160 
cctccccttc ctcttgtctg atttcttagg ggaaggagaa gtaagaggct acctcttacc 2220 
taacatctga cctggcatct aattctgatt ctggctttaa gccttcaaaa ctatagcttg 2280 
cagaactgta gctgccatgg ctaggtagaa gtgagcaaaa aagagttggg tgtctcctta 2340 
agctgcagag atttctcatt gacttttata aagcatgttc acccttatag tctaagacta 2400 
tatatataaa tgtataaata tacagtatag atttttgggt ggggggcatt gagtattgtt 2460 
taaaatgtaa tttaaatgaa agaaaattga gttgcactta ttgaccattt tttaatttac 2520 
ttgttttgga tggcttgtct atactccttc ccttaagggg tatcatgtat ggtgataggt 258 0 
atctagagct taatgctaca tgtgagtgac gatgatgtac agattctttc agttctttgg 2 640 
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attctaaata catgccacat caaacctttg agtagatcca tttccattgc ttattatgta 27 00 

ggtaagactg tagatatgta ttcttttctc agtgttggta tattttatat tactgacatt 27 60 

tcttctagtg atgatggttc acgttggggt gatttaatcc agttataaga agaagttcat 2820 

gtccaaacgt cctctttagt ttttggttgg gaatgaggaa aattcttaaa aggcccatag 28 8 0 

cagccagttc aaaaacaccc gacgtcatgt atttgagcat atcagtaacc cccttaaatt 2940 

taataccaga taccttatct tacaatattg attgggaaaa catttgctgc cattacagag 3000 

gtattaaaac taaatttcac tactagattg actaactcaa atacacattt gctactgttg 3060 

taagaattct gattgatttg attgggatga atgccatcta tctagttcta acagtgaagt 3120 

tttactgtct attaatattc agggtaaata ggaatcattc agaaatgttg agtctgtact 3180 

aaacagtaag atatctcaat gaaccataaa ttcaactttg taaaaatctt ttgaagcata 32 40 

gataatattg tttggtaaat gtttcttttg tttggtaaat gtttctttta aagaccctcc 3300 

tattctataa aactctgcat gtagaggctt gtttaccttt ctctctctaa ggtttacaat 33 60 

aggagtggtg atttgaaaaa tataaaatta tgagattggt tttcctgtgg cataaattgc 3420 

atcactgtat cattttcttt tttaaccggt aagagtttca gtttgttgga aagtaactgt 34 80 

gagaacccag tttcccgtcc atctccctta gggactaccc atagacatga aaggtcccca 3540 

cagagcaaga gataagtctt tcatggctgc tgttgcttaa accacttaaa cgaagagttc 3600 

ccttgaaact ttgggaaaac atgttaatga caatattcca gatctttcag aaatataaca 3660 

catttttttg catgcatgca aatgagctct gaaatcttcc catgcattct ggtcaagggc 3720 

tgtcattgca cataagcttc cattttaatt ttaaagtgca aaagggccag cgtggctcta 37 80 

aaaggtaatg tgtggattgc ctctgaaaag tgtgtatata ttttgtgtga aattgcatac 38 4 0 

tttgtatttt gattattttt tttttcttct tgggatagtg ggatttccag aaccacactt 3900 

gaaacctttt tttatcgttt ttgtattttc atgaaaatac catttagtaa gaataccaca 3960 

tcaaataaga aataatgcta caattttaag aggggaggga agggaaagtt tttttttatt 4020 

atttttttaa aattttgtat gttaaagaga atgagtcctt gatttcaaag ttttgttgta 4080 

cttaaatggt aataagcact gtaaacttct gcaacaagca tgcagctttg caaacccatt 4140 

aaggggaaga atgaaagctg ttccttggtc ctagtaagaa gacaaactgc ttcccttact 4200 

ttgctgaggg tttgaataaa cctaggactt ccgagctatg tcagtactat tcaggtaaca 42 60 

ctagggcctt ggaaattcct gtactgtgtc tcatggattt ggcactagcc aaagcgaggc 4320 

acccttactg gcttacctcc tcatggcagc ctactctcct tgagtgtatg agtagccagg 4380 

gtaaggggta aaaggatagt aagcatagaa accactagaa agtgggctta atggagttct 44 40 

tgtggcctca gctcaatgca gttagctgaa gaattgaaaa gtttttgttt ggagacgttt 4500 

ataaacagaa atggaaagca gagttttcat taaatccttt tacctttttt ttttcttggt 45 60 

aatcccctaa aataacagta tgtgggatat tgaatgttaa agggatattt tttttctatt 4620 

atttttataa ttgtacaaaa ttaagcaaat gttaaaagtt ttatatgctt tattaatgtt 4680 

ttcaaaaggt attatacatg tgatacattt tttaagcttc agttgcttgt cttctggtac 47 4 0 

tttctgttat gggcttttgg ggagccagaa gccaatctac aatctctttt tgtttgccag 4800 

gacatgcaat aaaatttaaa aaataaataa aaactaatta agaaataaa 4849 

<210> 336 
<211> 1386 
<212> DNA 
<213> Homo sapiens 

<400> 336 

atgttgtacc tggaaaacaa tgcccagact caatttagtg agccacagta cacgaacctg 60 
gggctcctga acagcatgga ccagcagatt cagaacggct cctcgtccac cagtccctat 12 0 
aacacagacc acgcgcagaa cagcgtcacg gcgccctcgc cctacgcaca gcccagctcc 18 0 
accttcgatg ctctctctcc atcacccgcc atcccctcca acaccgacta cccaggcccg 24 0 
cacagtttcg acgtgtcctt ccagcagtcg agcaccgcca agtcggccac ctggacgtat 300 
tccactgaac tgaagaaact ctactgccaa attgcaaaga catgccccat ccagatcaag 360 
gtgatgaccc cacctcctca gggagctgtt atccgcgcca tgcctgtcta caaaaaagct 42 0 
gagcacgtca cggaggtggt gaagcggtgc cccaaccatg agctgagccg tgaattcaac 480 
gagggacaga ttgcccctcc tagtcatttg attcgagtag aggggaacag ccatgcccag 54 0 
tatgtagaag atcccatcac aggaagacag agtgtgctgg taccttatga gccaccccag 60 0 
gttggcactg aattcacgac agtcttgtac aatttcatgt gtaacagcag ttgtgttgga 660 
gggatgaacc gccgtccaat tttaatcatt gttactctgg aaaccagaga tgggcaagtc 72 0 
ctgggccgac gctgctttga ggcccggatc tgtgcttgcc caggaagaga caggaaggcg 78 0 
gatgaagata gcatcagaaa gcagcaagtt tcggacagta caaagaacgg tgatggtacg 84 0 
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aagcgcccgt ttcgtcagaa cacacatggt 
tccccagatg atgaactgtt atacttacca 
ttgaagatca aagagtccct ggaactcatg 
tacaggcaac agcaacagca gcagcaccag 
tctccatctt catatggtaa cagctcccca 
ctgccttctg tgagccagct tatcaaccct 
attcctgatg gcatgggagc caacattccc 
gacatgaatg gactcagccc cacccaggca 
tcccactgca cacccccacc tccgtatccc 
gtctga 

<210> 337 
<211> 1551 
<212> DNA 
<213> Homo sapiens 



atccagatga catccatcaa gaaacgaaga 900 

gtgaggggcc gtgagactta tgaaatgctg 960 

cagtaccttc ctcagcacac aattgaaacg 1020 

cacttacttc agaaacagac ctcaatacag 1080 

cctctgaaca aaatgaacag catgaacaag 1140 

cagcagcgca acgccctcac tcctacaacc 12 00 

atgatgggca cccacatgcc aatggctgga 1260 

ctccctcccc cactctccat gccatccacc 1320 

acagattgca gcattgtcag gatctggcaa 1380 
1386 



<400> 337 

atgtcccaga gcacacagac aaatgaattc ctcagtccag aggttttcca gcatatctgg 60 
gattttctgg aacagcctat atgttcagtt cagcccattg acttgaactt tgtggatgaa 12 0 
ccatcagaag atggtgcgac aaacaagatt gagattagca tggactgtat ccgcatgcag 18 0 
gactcggacc tgagtgaccc catgtggcca cagtacacga acctggggct cctgaacagc 240 
atggaccagc agattcagaa cggctcctcg tccaccagtc cctataacac agaccacgcg 300 
cagaacagcg tcacggcgcc ctcgccctac gcacagccca gctccacctt cgatgctctc 360 
tctccatcac ccgccatccc ctccaacacc gactacccag gcccgcacag tttcgacgtg 420 
tccttccagc agtcgagcac cgccaagtcg gccacctgga cgtattccac tgaactgaag 480 
aaactctact gccaaattgc aaagacatgc cccatccaga tcaaggtgat gaccccacct 54 0 
cctcagggag ctgttatccg cgccatgcct gtctacaaaa aagctgagca cgtcacggag 600 
gtggtgaagc ggtgccccaa ccatgagctg agccgtgaat tcaacgaggg acagattgcc 660 
cctcctagtc atttgattcg agtagagggg aacagccatg cccagtatgt agaagatccc 720 
atcacaggaa gacagagtgt gctggtacct tatgagccac cccaggttgg cactgaattc 780 
acgacagtct tgtacaattt catgtgtaac agcagttgtg ttggagggat gaaccgccgt 840 
ccaattttaa tcattgttac tctggaaacc agagatgggc aagtcctggg ccgacgctgc 900 
tttgaggccc ggatctgtgc ttgcccagga agagacagga aggcggatga agatagcatc 960 
agaaagcagc aagtttcgga cagtacaaag aacggtgatg gtacgaagcg cccgtttcgt 1020 
cagaacacac atggtatcca gatgacatcc atcaagaaac gaagatcccc agatgatgaa 1080 
ctgttatact taccagtgag gggccgtgag acttatgaaa tgctgttgaa gatcaaagag 1140 
tccctggaac tcatgcagta ccttcctcag cacacaattg aaacgtacag gcaacagcaa 1200 
cagcagcagc accagcactt acttcagaaa cagacctcaa tacagtctcc atcttcatat 12 60 
ggtaacagct ccccacctct gaacaaaatg aacagcatga acaagctgcc ttctgtgagc 1320 
cagcttatca accctcagca gcgcaacgcc ctcactccta caaccattcc tgatggcatg 1380 
ggagccaaca ttcccatgat gggcacccac atgccaatgg ctggagacat gaatggactc 14 4 0 
agccccaccc aggcactccc tcccccactc tccatgccat ccacctccca ctgcacaccc 1500 
ccacctccgt atcccacaga ttgcagcatt gtcaggatct ggcaagtctg a 1551 

<210> 338 
<211> 586 
<212> PRT 

<213> Homo sapiens 
<400> 338 

Met Leu Tyr Leu Glu Asn Asn Ala Gin Thr Gin Phe Ser Glu Pro Gin 

15 10 15 

Tyr Thr Asn Leu Gly Leu Leu Asn Ser Met Asp Gin Gin lie Arg Asn 

20 25 30 

Gly Ser Ser Ser Thr Ser Pro Tyr Asn Thr Asp His Ala Gin Asn Ser 

35 40 45 

Val Thr Ala Pro Ser Pro Tyr Ala Gin Pro Ser Pro Thr Phe Asp Ala 
50 55 60 
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Leu 


Ser 


Pro 


Ser 


Pro 


Ala 


He 


Pro 


Ser 


Asn 


Thr 


Aso 


Tvr 


Pro 


Gly 


Pro 


65 










70 










75 










80 


His 


Ser 


Ser 


Asp 


Val 




Phe 


Gin 


Gin 


Ser 




Thr 


Ala 


Lys 


Ser 


Ala 










85 










90 










95 




Thr 


Trp 


Thr 


Tyr 


Ser 


Thr 


Glu 


Leu 


Lys 


Lys 


Leu 


Tvr 


Cys 


Gin 


He 


Ala 








100 










105 










110 






Lys 


Thr 


Cys 


Pro 


He 


Gin 


He 


Lys 


Val 


Met 


Thr 


Pro 


Pro 


Pro 


Gin 


Gly 






115 










120 










125 








Ala 


Val 


He 


Arg 


Ala 


Met 


Pro 


Val 


Tvr 


Lys 


Lys 


Ala 


Glu 


His 


Val 


Thr 




130 










135 










140 










Glu 


Val 


Val 


Lys 


Arg 


Cys 


Pro 


Asn 


His 


Glu 


Leu 


Ser 


Arg 


Glu 


Phe 


Asn 


145 










150 










155 










160 


Glu 


Gly 


Gin 


He 


Ala 


Pro 


Pro 


Ser 


His 


Leu 


He 


Arg 


Val 


Glu 


Gly 












165 










170 










175 




Ser 


His 


Ala 


Gin 


Tyr 


Val 


Glu 


Asp 


Pro 


He 


Thr 


Gly 


Arg 


Gin 


Ser 


Val 








180 










185 










190 






Leu 


Val 


Pro 


Tyr 


Glu 


Pro 


Pro 


Gin 


Val 


Gly 


Thr 


Glu 


Phe 


Thr 


Thr 


Val 






195 










200 








205 








Leu 


Tyr 




Phe 


Met 


Cys 


Asn 


Ser 


Ser 


Cys 


Val 


Gly 


Gly 


Met 


Asn 


Arg 




210 










215 










220 










Arg 




He 


Leu 


He 


He 


Val 


Thr 


Leu 


Glu 


Thr 


Arg 


Asp 


Gly 


Gin 


Val 


225 










230 










235 










240 




Gly 


A 

rg 


rg 


Cys 


Phe 


Glu 


Ala 


Arg 


He 


Cys 


Ala 


Cvs 


Pro 


Gly 


Arg 










24 5 










250 










255 




sp 


- 

. rg 


L 


Ala 


Asp 


Glu 


sp 


Ser 


He 


Arg 


Lvs 
ys 


Gin 


Gin 


Val 




Aso 








260 










265 










270 






Ser 


Thr 


Lys 




Gly 


Asp 


Gly 


Thr 


Lys 


Arg 


Pro 


Phe 


Arg 


Gin 


Asn 


Thr 






275 










280 










285 








His 


Gly 


He 


Gin 


Met 


Thr 


Ser 


He 


Lys 


Lys 


Arg 


Arg 


Ser 


Pro 


Asp 


Asp 




290 










295 










300 










Glu 




Leu 


Tyr 


Leu 


Pro 


Val 


Arg 


Gly 


Arg 


Glu 


Thr 


Tyr 


Glu 


Met 


Leu 


305 










310 










315 










320 




Lys 


He 


Lys 


Glu 


Ser 


Leu 


Glu 


Leu 


Met 


Gin 


Tyr 


Leu 


Pro 


Gin 


His 










325 










330 










335 




Thr 


lie 


Glu 


Thr 


Tyr 


Arg 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


His 


Gin 


His 


Leu 








340 










345 










350 






Leu 


Gin 


Lys 


Gin 


Thr 


Ser 


He 


Gin 


Ser 


Pro 


Ser 


Ser 


Tyr 


Gly 


Asn 


Ser 






355 










360 










365 








Ser 


Pro 


Pro 


Leu 


Asn 


Lys 


Met 


Asn 


Ser 


Met 


Asn 


Lys 




Pro 


Ser 


Val 




370 








375 










380 










Ser 


Gin 


Leu 


He 


Asn 


Pro 


Gin 


Gin 


Arg 


Asn 


Ala 


Leu 


Thr 


Pro 


Thr 


Thr 


385 










390 










395 










400 


lie 


Pro 


Asp 


Gly 


Met 


Gly 


Ala 


Asn 


He 


Pro 


Met 


Met 


Gly 


Thr 


His 


Met 










405 










410 










415 




Pro 


Met 


Ala 


Gly 


Asp 


Met 


Asn 


Gly 


Leu 


Ser 


Pro 


Thr 


Gin 


Ala 


Leu 


Pro 








420 










425 










430 






Pro 


Pro 


Leu 




Met 


Pro 


Ser 


Thr 


Ser 


His 


Cvs 


Thr 


Pro 


Pro 


Pro 


Pro 






435 










440 










445 








T 


Pro 




Asd 


Cvs 


Ser 


He 


Val 


Ser 


Phe 


Leu 


Ala 


Arg 


Leu 


Gly 


Cvs 




450 










455 










460 










Ser 


Ser 


Cys 


Leu 


Asp 


Tyr 


Phe 


Thr 


Thr 


Gin 


Gly 


Leu 


Thr 


Thr 


He 


Tyr 


465 










470 










475 










480 


Gin 


He 


Glu 


His 


Tyr 


Ser 


Met 


Asp 


Asp 


Leu 


Ala 


Ser 


Leu 


Lys 


He 


Pro 










485 










490 










495 




Glu 


Gin 


Phe 


Arg 


His 


Ala 


He 


Trp 


Lys 


Gly 


He 


Leu 


Asp 


His 


Arg 


Gin 








500 










505 










510 






Leu 


His 


Glu 


Phe 


Ser 


Ser 


Pro 


Ser 


His 


Leu 


Leu 


Arg 


Thr 


Pro 


Ser 


Ser 






515 










520 










525 
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Ala Ser Thr Val Ser Val Gly Ser 

530 535 
He Asp Ala Val Arg Phe Thr Leu 
545 550 
Arg Asp Glu Trp Asn Asp Phe Asn 
565 

Lys Gin Gin Arg He Lys Glu Glu 
580 



Ser Glu Thr Arg Gly Glu Arg Val 
540 

Arg Gin Thr He Ser Phe Pro Pro 
555 560 
Phe Asp Met Asp Ala Arg Arg Asn 

570 575 
Gly Glu 
585 



<210> 339 
<211> 641 
<212> PRT 

<213> Homo sapiens 



<400> 339 



Met 


Ser 


Gin 


Ser 


Thr 


Gin 


Thr 


Asn 


Glu 


Phe 


Leu 


Ser 


Pro 


Glu 


Val 


Phe 


1 








5 










10 










15 




Gin 


His 


He 


Trp 
20 


Asp 


Phe 


Leu 


Glu 


Gin 
25 


Pro 


He 


Cys 


Ser 


Val 
30 


Gin 


Pro 


He 


Asp 


Leu 
35 


Asn 


Phe 


Val 


Asp 


Glu 
40 


Pro 


Ser 


Glu 


Asp 


Gly 
45 


Ala 


Thr 


Asn 


Lys 


He 
50 


Glu 


He 


Ser 


Met 


Asp 
55 


Cys 


He 


Arg 


Met 


Gin 
60 


Asp 


Ser 


Asp 


Leu 


Ser 


Asp 


Pro 


Met 


Trp 


Pro 


Gin 


Tyr 


Thr 


Asn 


Leu 


Gly 


Leu 


Leu 


Asn 


Ser 


65 










70 










75 










80 


Met 


Asp 


Gin 


Gin 


He 
85 


Gin 


Asn 


Gly 


Ser 


Ser 
90 


Ser 


Thr 


Ser 


Pro 


Tyr 
95 


Asn 


Thr 


Asp 


His 


Ala 
100 


Gin 


Asn 


Ser 


Val 


Thr 
105 


Ala 


Pro 


Ser 


Pro 


Tyr 
110 


Ala 


Gin 


Pro 


Ser 


Ser 


Thr 


Phe Asp 


Ala 


Leu 


Ser 


Pro 


Ser 


Pro 


Ala 


He 


Pro 


Ser 






115 










120 










125 








Asn 


Thr 


Asp 


Tyr 


Pro Gly 


Pro 


His 


Ser 


Phe 


Asp 


Val 


Ser 


Phe 


Gin 


Gin 




130 










135 










140 










Ser 


Ser 


Thr 


Ala 


Lys 


Ser 


Ala 


Thr 


Trp 


Thr 


Tyr 


Ser 


Thr 


Glu 


Leu 


Lys 


145 










150 










155 










160 


Lys 


Leu 


Tyr 


Cys 


Gin 
165 


He 


Ala 


Lys 


Thr 


Cys 
170 


Pro 


He 


Gin 


He 


Lys 
175 


Val 


Met 


Thr 


Pro 


Pro 
180 


Pro 


Gin 


Gly 


Ala 


Val 
185 


He 


Arg 


Ala 


Met 


Pro 
190 


Val 


Tyr 


Lys 


Lys 


Ala 
195 


Glu 


His 


Val 


Thr 


Glu 
200 


Val 


Val 


Lys 


Arg 


Cys 
205 


Pro 


Asn 


His 


Glu 


Leu 
210 


Ser 


Arg 


Glu 


Phe 


Asn 
215 


Glu 


Gly 


Gin 


He 


Ala 
220 


Pro 


Pro 


Ser 


His 


Leu 


lie 


Arg 


Val 


Glu 


Gly 


Asn 


Ser 


His 


Ala 


Gin 


Tyr 


Val 


Glu 


Asp 


Pro 


225 










230 










235 










240 


He 


Thr 


Gly 


Arg 


Gin 
245 


Ser 


Val 


Leu 


Val 


Pro 
250 


Tyr 


Glu 


Pro 


Pro 


Gin 
255 


Val 


Gly 


Thr 


Glu 


Phe 
260 


Thr 


Thr 


Val 


Leu 


Tyr 
265 


Asn 


Phe 


Met 


Cys 


Asn 
270 


Ser 


Ser 


Cys 


Val 


Gly 


Gly 


Met 


Asn 


Arg Arg 


Pro 


He 


Leu 


He 


He 


Val 


Thr 


Leu 






275 










280 










285 








Glu 


Thr 
290 


Arg 


Asp 


Gly 


Gin 


Val 
295 


Leu 


Gly 




Arg 


Cys 
300 


Phe 


Glu 


Ala 


Arg 


He 


Cys 


Ala 


Cys 


Pro 


Gly 


Arg 


Asp 


Arg 


Lys 


Ala 


Asp 


Glu 


Asp 


Ser 


He 


305 










310 










315 










320 


Arg 


Lys 


Gin 


Gin 


Val 
325 


Ser 


Asp 


Ser 


Thr 


Lys 
330 


Asn 


Gly 


Asp 


Gly 


Thr 
335 


Lys 
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Arg 


Pro 


Phe Arg 
340 


Gin 


Asn 


Thr 


His 


Gly 
345 


He 


Gin 


Met 


Thr 


Ser 
350 


He 


Lys 


Lys 


Arg 


Arg Ser 
355 


Pro 


Asp 


Asp 


Glu 
360 


Leu 


Leu 


Tyr 


Leu 


Pro 
365 


Val 


Arg 


Gly 


Arg 


Glu 

370 


Thr Tyr 


Glu 


Met 


Leu 
375 




Lys 


He 


Lys 


Glu 
380 


Ser 


Leu 


Glu 


Leu 


Met 


Gin 


Tyr Leu 


Pro 


Gin 


His 


Thr 


He 


Glu 


Thr 


Tyr 


Arg 


Gin 


Gin 


Gin 


385 








390 










395 










400 


Gin 


Gin 


Gin His 


Gin 
405 


His 




Leu 


Gin 


410 


Gin 


Thr 


Ser 


He 


Gin 
415 


Ser 


Pro 


Ser 


Ser Tyr 


Gly Asn 


Ser 


Ser 


Pro 


Pro 


Leu 


Asn 


Lys 


Met 


Asn 


Ser 






420 










425 










430 






Met 




Lys Leu 
435 


Pro 


Ser 


Val 


Ser 
440 


Gin 




He 


Asn 


Pro 
445 


Gin 


Gin 


Arg 


Asn 


Ala 

450 


Leu Thr 


Pro 


Thr 


Thr 
455 


He 


Pro 


Asp 


Gly 


Met 
460 


Gly 


Ala 


Asn 


He 


Pro 


Met 


Met Gly 


Thr 


His 


Met 


Pro 


Met 


Ala 


Gly 


Asp 


Met 


Asn 


Gly 


Leu 


465 








470 










475 










480 


Ser 


Pro 


Thr Gin 


Ala 
485 


Leu 


Pro 


Pro 


Pro 


490 


Ser 


Met 


Pro 


Ser 


Thr 
495 


Ser 


His 


Cys 


Thr Pro 
500 


Pro 


Pro 


Pro 


Tyr 


Pro 
505 


Thr 


Asp 


Cys 


Ser 


He 
510 


Val 


Gly 


Phe 




Ala Arg 
515 


Leu 


Gly 




Ser 
520 










525 


Phe 


Thr 


Thr 


Gin 


Gly 
530 


Leu Thr 


Thr 


He 


535 


Gin 


lie 


Glu 


His 


540 


Ser 


Met 








Ala 


Ser Leu 


Lys 


He 


Pro 


Glu 


Gin 


Phe 




His 


Ala 


He 


Trp 


Lys 


545 








550 










555 










560 


Gly 


lie 


Leu Asp 


His 
565 


Arg 


Gin 


Leu 


His 


Glu 
570 


Phe 


Ser 


Ser 


Pro 


Ser 
575 


His 


Leu 


Leu 


Arg Thr 
580 


Pro 


Ser 


Ser 


Ala 


Ser 
585 


Thr 


Val 


Ser 


Val 


Gly 
590 


Ser 


Ser 


Glu 


Thr 


Arg Gly 
595 


Glu 


Arg 


Val 


He 

600 


Asp 


Ala 


Val 


Arg 


Phe 
605 


Thr 


Leu 


Arg 


Gin 


Thr 


He Ser 


Phe 


Pro 


Pro Arg 


Asp 


Glu 


Trp 


Asn 


Asp 


Phe 


Asn 


Phe 




610 








615 










620 










Asp 


Met 


Asp Ala 


Arg Arg 


Asn 


Lys 


Gin 


Gin 


Arg 


He 


Lys 


Glu 


Glu 


Gly 


625 








630 










635 










640 



<210> 340 
<211> 448 
<212> PRT 

<213> Homo sapiens 
<400> 340 

Met Ser Gin Ser Thr Gin Thr Asn 

1 5 
Gin His He Trp Asp Phe Leu Glu 
20 

He Asp Leu Asn Phe Val Asp Glu 

35 40 
Lys He Glu He Ser Met Asp Cys 

50 55 
Ser Asp Pro Met Trp Pro Gin Tyr 
65 70 



Glu Phe Leu Ser Pro Glu Val Phe 

10 15 
Gin Pro He Cys Ser Val Gin Pro 
25 30 
Pro Ser Glu Asp Gly Ala Thr Asn 
45 

He Arg Met Gin Asp Ser Asp Leu 
60 

Thr Asn Leu Gly Leu Leu Asn Ser 
75 80 
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Gin 


Gin 


He 


Gin Asn 


Gly 










85 






Thr 


Asp 


His 


Ala 


Gin 


Asn Ser 


Val 








100 








Pro 


Ser 


Ser 


Thr 


Phe Asp Ala 


Leu 






115 








120 


Asn 


Thr 


Asp 


Tvr 


Pro 


Gl Pro 


His 




130 








^ 135 




Ser 




Thr 


Ala 


L s 
ys 


Ser Ala 


Thr 


145 










150 




L 

ys 


Leu 


T 

yr 


c 

ys 


Gin 


He Ala 


_ 

ys 










165 






Met 


Thr 


Pro 
ro 


Pro 




Gin Gly 


Ala 








180 








Lvs 


Lvs 


Ala 


Glu 


His 


Val Thr 


Glu 






195 








200 


Glu 


Leu 


Ser 


Arq 


Glu 


Phe Asn 


Glu 




210 








215 




Leu 


He 


Ara 


Val 


Glu 


Gly Asn 


Ser 


225 










230 




lie 


Thr 


Gly 


Ara 


Gin 


Ser Val 


Leu 










245 






Gly 


Thr 


Glu 


Phe 


Thr 


Thr Val 


Leu 








260 








Cvs 


Val 


Gly 


Gly 


Met 


Asn Arg 


Arg 






275 








280 


Glu 


Thr 


Arg 


Asp 


Gly Gin Val 






290 








295 




lie 


Cys 


Ala 


Cys 


Pro 


Gly Arg 


Asp 


305 










310 




Arg 


Lys 


Gin 


Gin 


Val 


Ser Asp 


Ser 










325 






Ara 


Pro 


Phe 


Arg 


Gin 


Asn Thr 


His 








340 








L 

ys 


Ar 


Arg 




Pro Asp Asp 


Glu 






355 








360 


rg 


Glu 


Thr 


_ 

yr 


Glu 


Met Leu 


Leu 




370 








375 




Met 


Gin 


Tyr 


Leu 


Pro 


Gin His 


Thr 


385 










390 




Gin 


Gin 


Gin 


His 


Gin 


His Leu 


Leu 










405 






Phe 


Arg 


Asn 


Glu 


Leu 


Val Glu 


Pro 








420 








Asp 


Val 


Phe 


Phe 


Arg His Ser 


Lys 






435 








440 



<210> 341 
<211> 356 
<212> PRT 





















90 










95 




Thr 


Ala 


Pro 


Ser 


Pro 


Tyr 


Ala 


Gin 


105 










110 






Ser 


Pro 




Pro 


Ala 


He 


Pro 


Ser 










125 








Ser 


Phe 


Aso 


Val 


Ser 


Phe 


Gin 


Gin 








140 










T 

rp 


Thr 


T 


Ser 


Thr 


Glu 


Leu 


Lvs 






155 










160 


Thr 


Cys 


Pro 


He 


Gin 


He 


Lys 


Val 




170 










175 




Val 




rg 


Ala 


Met 


Pro 




Tvr 

y 


185 










190 






Val 


Val 


Lys 


Arg 


Cys 




Asn 


His 










205 








Gly 


Gin 


He 


Ala 


Pro 


Pro 


Ser 


His 








220 










His 


Ala 


Gin 


Tyr 


Val 


Glu 


Asp 


Pro 






235 










240 


Val 


Pro 


Tyr 


Glu 


Pro 


Pro 


Gin 


Val 




250 










255 




Tyr 




Phe 


Met 


Cvs 


Asn 


Ser 


Ser 


265 










270 






Pro 


He 


Leu 


He 


lie 


Val 


Thr 


Leu 










285 








Gly 


Arg 


Arg 


Cys 


Phe 


Glu 


Ala 


Arg 








300 










Arg 


Lys 


Ala 




Glu 


Asp 


Ser 


He 






315 










320 


Thr 


Lys 




Gly 


Asp 


Gly 


Thr 


Lys 




330 










335 




Gly 


He 


Gin 


Met 


Thr 


Ser 


He 


Lvs 


345 










350 








Leu 


Tvr 


Leu 


Pro 


Val 


Arq 


Glv 










365 










He 




Glu 




Leu 


Glu 


Leu 








380 










He 


Glu 


Thr 


Tyr 


Arg 


Gin 


Gin 
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395 










400 


Gin 
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Leu 


Leu 


Ser 


Ala 


Cys 




410 










415 




Arg 


Arg 


Glu 


Thr 


Pro 


Lys 


Gin 


Ser 


425 










430 






Pro 


Pro 


Asn 


Arg 


Ser 


Val 


Tyr 


Pro 



445 



<213> Homo sapiens 
<400> 341 

Met Leu Tyr Leu Glu Asn Asn Ala Gin Thr Gin Phe Ser Glu Pro Gin 

15 10 15 

Tyr Thr Asn Leu Gly Leu Leu Asn Ser Met Asp Gin Gin He Gin Asn 
20 25 30 
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Gly 


Ser 


Ser 


Ser 


Thr 


Ser 


Pro 


Tyr 


Asn 


Thr 


Asp 


His 


Ala 


Gin 


Asn 


Ser 






35 










40 










45 








Val 


Thr 


Ala 


Pro 


Ser 


Pro 


Tyr Ala Gin 


Pro 


Ser 


Ser 


Thr 


Phe 


Asp 


Ala 




50 










55 
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Leu 


Ser 


Pro 


Ser 


Pro 


Ala 


He 


Pro 


Ser 


Asn 


Thr 
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Pro 


Gly 


Pro 
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Ser 
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Asp 


Val 


Ser 


Phe 


Gin 


Gin 


Ser 


Ser 


Thr 


Ala 


Lys 


Ser 


Ala 
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Thr 
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Thr 


Glu 


Leu 


Lys 


Lys 
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Tyr 


Cys 


Gin 


He 


Ala 
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Pro 
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Gin 
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Lys 


Val 


Met 
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Pro 


Pro 


Pro 


Gin 


Gly 
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120 
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Ala 
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Arg 


Ala 


Met 


Pro 


Val 


Tyr 


Lys 


Lys 


Ala 


Glu 


His 


Val 


Thr 




130 










135 










140 










Glu 


Val 


Val 


Lys 


Arg 


Cys 


Pro 


Asn 
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Glu 


Leu 


Ser 


Arg 


Glu 


Phe 


Asn 


145 










150 
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Glu 


Gly 
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Ala 
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Ser 
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Leu 


He 


Arg 


Val 


Glu 


Gly 


Asn 
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Ala 
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Tyr 


Val 


Glu 


Asp 
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He 


Thr 


Gly 


Arg 


Gin 


Ser 


Val 








180 










185 










190 






Leu 


Val 


Pro 


Tyr 


Glu 


Pro 


Pro 


Gin 


Val 


Gly 


Thr 


Glu 


Phe 


Thr 


Thr 


Val 






195 










200 










205 








Leu 


Tyr 




Phe 


Met 


Cys 


Asn 
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Val 


Gly 
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Met 
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Arg 




210 










215 










220 










Arg 
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lie 


Leu 
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lie 


Val 


Thr 


Leu 


Glu 


Thr 


Arg 
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Gly 


Gin 


Val 


225 










230 
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240 
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Arg 


Arg 


Cys 


Phe 


Glu 


Ala 


Arg 


He 


Cys 


Ala 


Cys 


Pro 


Gly 


Arg 










245 










250 
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Lys 
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Glu 


Asp 


Ser 


He 


Arg 


Lys 


Gin 
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Val 


Ser 


Asp 








260 










265 
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Ser 
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Lys 
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Gly Asp 
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Ser 
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Asn 


Thr 
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Pro 


Asp 


Asp 




290 










295 
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<210> 345 

<211> 1800 

<212> DNA 

<213> Homo sapiens 

<400> 345 

gcgcctcatt gccactgcag tgactaaagc 
actggttgtt ttttaaacaa . attctgatac 
tgacattcgt atcatcactg tgcaccattg 
ggaggtctga aaccctcgca gagggatctt 
agtcgttgga aacaggactc agggataaac 
tttcatcggg ggtgtcaaca aacactccac 



tgggaagacg ctggtcagtt cacctgcccc 60 
aggcgacatc ctcactgacc gagcaaagat 12 0 
gcttctaggc actccagtgg ggtaggagaa 18 0 
gccctcattc tttgggtctg aaacactggc 240 
cagcgcaatg gattggggga cgctgcacac 300 
cagcatcggg aaggtgtgga tcacagtcat 360 
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ctttattttc cgagtcatga tcctagtggt ggctgcccag gaagtgtggg gtgacgagca 42 0 
agaggacttc gtctgcaaca cactgcaacc gggatgcaaa aatgtgtgct atgaccactt 480 
tttcccggtg tcccacatcc ggctgtgggc cctccagctg atcttcgtct ccaccccagc 540 
gctgctggtg gccatgcatg tggcctacta caggcacgaa accactcgca agttcaggcg 600 
aggagagaag aggaatgatt tcaaagacat agaggacatt aaaaagcaca aggttcggat 660 
agaggggtcg ctgtggtgga cgtacaccag cagcatcttt ttccgaatca tctt.tgaagc 720 
agcctttatg tatgtgtttt acttccttta caatgggtac cacctgccct gggtgttgaa 780 
atgtgggatt gacccctgcc ccaaccttgt tgactgcttt atttctaggc caacagagaa 84 0 
gaccgtgttt accattttta tgatttctgc gtctgtgatt tgcatgctgc ttaacgtggc 90 0 
agagttgtgc tacctgctgc tgaaagtgtg ttttaggaga tcaaagagag cacagacgca 960 
aaaaaatcac cccaatcatg ccctaaagga gagtaagcag aatgaaatga atgagctgat 1020 
ttcagatagt ggtcaaaatg caatcacagg tttcccaagc taaacatttc aaggtaaaat 1080 
gtagctgcgt cataaggaga cttctgtctt ctccagaagg caataccaac ctgaaagttc 1140 
cttctgtagc ctgaagagtt tgtaaatgac tttcataata aatagacact tgagttaact 12 00 
ttttgtagga tacttgctcc attcatacac aacgtaatca aatatgtggt ccatctctga 12 60 
aaacaagaga ctgcttgaca aaggagcatt gcagtcactt tgacaggttc cttttaagtg 1320 
gactctctga caaagtgggt actttctgaa aatttatata actgttgttg ataaggaaca 1380 
tttatccagg aattgatacg tttattagga aaagatattt ttataggctt ggatgttttt 1440 
agttccgact ttgaatttat ataaagtatt tttataatga ctggtcttcc ttacctggaa 1500 
aaacatgcga tgttagtttt agaattacac cacaagtatc taaatttcca acttacaaag 1560 
ggtcctatct tgtaaatatt gttttgcatt gtctgttggc aaatttgtga actgtcatga 1620 
tacgcttaag gtgggaaagt gttcattgca caatatattt ttactgcttt ctgaatgtag 1680 
acggaacagt gtggaagcag aaggcttttt taactcatcc gtttggccga tcgttgcaga 1740 
ccactgggag atgtggatgt ggttgcctcc ttttgctcgt ccccgtggct taacccttct 1800 



<210> 346 
<211> 261 
<212> PRT 

<213> Homo sapiens 



<400> 346 



Met 


Asp 


Trp 


Gly 


Thr Leu His 


Thr 


Phe 


He 


Gly 


Gly 


Val 


Asn 


Lys 


His 


1 








5 






10 










15 




Ser 


Thr 


Ser 


He 


Gly Lys Val 


Trp 


He 


Thr 


Val 


He 


Phe 


He 


Phe 


Arg 








20 






25 










30 






Val 


Met 


He 


Leu 


Val Val Ala 


Ala 


Gin 


Glu 


Val 


Trp 


Gly 


Asp 


Glu 


Gin 






35 






40 










45 








Glu 


Asp 


Phe 


Val 


Cys Asn Thr 


Leu 


Gin 


Pro 


Gly 


Cys 


Lys 


Asn 


Val 


Cys 




50 






55 










60 










Tyr 


Asp 


His 


Phe 


Phe Pro Val 


Ser 


His 


He 


Arg 


Leu 


Trp 


Ala 


Leu 


Gin 


65 








70 








75 










80 


Leu 


He 


Phe 


Val 


Ser Thr Pro 


Ala 


Leu 


Leu 


Val 


Ala 


Met 


His 


Val 


Ala 










85 






90 










95 




Tyr 


Tyr 


Arg 


His 


Glu Thr Thr 


Arg 


Lys 


Phe 


Arg 


Arg 


G1 Y 


Glu 


Lys 


Arg 








100 






105 










110 






Asn 


Asp 


Phe 


Lys 


Asp He Glu 


Asp 


He 


Lys 




His 




Val 


Arg 


He 






115 






120 










125 








Glu 


Gly 


Ser 


Leu 


Trp Trp Thr 


Tyr 


Thr 


Ser 


Ser 


He 


Phe 


Phe 


Arg 


He 




130 






135 










140 










He 


Phe 


Glu 


Ala 


Ala Phe Met 


Tyr 


Val 


Phe 


Tyr 


Phe 


Leu 


Tyr 


Asn 


Gly 


145 








150 








155 










160 


Tyr 


His 


Leu 


Pro 


Trp Val Leu 


Lys 


Cys 


Gly 


He 


Asp 


Pro 


Cys 


Pro 


Asn 










165 






170 










175 




Leu 


Val 


Asp 


Cys 


Phe He Ser 


Arg 


Pro 


Thr 


Glu 


Lys 


Thr 


Val 


Phe 


Thr 








180 






185 










190 






He 


Phe 


Met 


He 


Ser Ala Ser 


Val 


He 


Cys 


Met 






Asn 


Val 


Ala 



195 200 205 
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Glu Leu Cys Tyr Leu Leu Leu Lys Val Cys Phe Arg Arg Ser Lys Arg 

210 215 220 

Ala Gin Thr Gin Lys Asn His Pro Asn His Ala Leu Lys Glu Ser Lys 
225 230 235 240 

Gin Asn Glu Met Asn Glu Leu He Ser Asp Ser Gly Gin Asn Ala He 

245 250 255 

Thr Gly Phe Pro Ser 
260 



<210> 347 
<211> 1740 
<212> DNA 
<213> Homo sapiens 

<400> 347 

atgaacaaac tgtatatcgg aaacctcagc 
atcttcaagg acgccaagat cccggtgtcg 
ttcgtggact gcccggacga gagctgggcc 
atagaactgc acgggaaacc catagaagtt 
cggaaacttc agatacgaaa tatcccgcct 
ctagtccagt atggagtggt ggagagctgt 
gttgtaaatg taacctattc cagtaaggac 
ggatttcagt tagagaattt caccttgaaa 
cagcaaaacc ccttgcagca gccccgaggt 
aggcaggggt ctccaggatc cgtatccaag 
ctggttccca cccaatttgt tggagccatc 
atcaccaaac agacccagtc taaaatcgat 
gagaagtcga ttactatcct ctctactcct 
ctggagatta tgcataagga agctcaagat 
attttagctc ataataactt tgttggacgt 
aaaattgagc aagacacaga cactaaaatc 
tataatccag aacgcactat tacagttaaa 
gaggagatca tgaagaaaat cagggagtct 
caagcacatt taattcctgg attaaatctg 
gggatgccac ctcccacctc agggccccct 
gagcaatcag aaacggagac tgttcatctg 
atcggcaagc agggccagca catcaagcag 
attgctccag cggaagcacc agatgctaaa 
gaggctcagt tcaaggctca gggaagaatt 
agtcctaaag aagaggtgaa acttgaagct 
agagttattg gaaaaggagg caaaacggtg 
gttgttgtcc ctcgtgacca gacacctgat 
ggtcacttct atgcttgcca ggttgcccag 
aagcagcacc aacaacagaa ggctctgcaa 



gagaacgccg ccccctcgga cctagaaagt 60 
ggacccttcc tggtgaagac tggctacgcg 12 0 
ctcaaggcca tcgaggcgct ttcaggtaaa 180 
gagcactcgg tcccaaaaag gcaaaggatt 24 0 
catttacagt gggaggtgct ggatagttta 300 
gagcaagtga acactgactc ggaaactgca 360 
caagctagac aagcactaga caaactgaat 420 
gtagcctata tccctgatga aacggccgcc 480 
cgccgggggc ttgggcagag gggctcctca 54 0 
cagaaaccat gtgatttgcc tctgcgcctg 600 
ataggaaaag aaggtgccac cattcggaac 660 
gtccaccgta aagaaaatgc gggggctgct 720 
gaaggcacct ctgcggcttg taagtctatt 780 
ataaaattca cagaagagat ccccttgaag 840 
cttattggta aagaaggaag aaatcttaaa 900 
acgatatctc cattgcagga attgacgctg 960 
ggcaatgttg agacatgtgc caaagctgag 1020 
tatgaaaatg atattgcttc tatgaatctt 1080 
aacgccttgg gtctgttccc acccacttca 1140 
tcagccatga ctcctcccta cccgcagttt 12 00 
tttatcccag ctctatcagt cggtgccatc 12 60 
ctttctcgct ttgctggagc ttcaattaag 1320 
gtgaggatgg tgattatcac tggaccacca 1380 
tatggaaaaa ttaaagaaga aaactttgtt 1440 
catatcagag tgccatcctt tgctgctggc 1500 
aatgaacttc agaatttgtc aagtgcagaa 1560 
gagaatgacc aagtggttgt caaaataact 1620 
agaaaaattc aggaaattct gactcaggta 1680 
agtggaccac ctcagtcaag acggaagtaa 1740 



<210> 348 
<211> 579 
<212> PRT 

<213> Homo sapiens 
<400> 348 

Met Asn Lys Leu Tyr He Gly Asn 

1 5 
Asp Leu Glu Ser He Phe Lys Asp 

20 

Phe Leu Val Lys Thr Gly Tyr Ala 



Leu Ser Glu Asn Ala Ala Pro Ser 

10 15 

Ala Lys He Pro Val Ser Gly Pro 
25 30 

Phe Val Asp Cys Pro Asp Glu Ser 
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35 40 45 



Trp Ala 


Leu 


Lys Ala He 


Glu Ala 


Leu Ser Gly Lys He Glu 


Leu 


His 


50 






55 




60 






Gly Lys 


Pro 


He Glu Val 


Glu His 


Ser Val Pro 


Lys Arg Gin 


Arg 


He 


65 




70 




75 






80 


Arg Lys 


Leu 


Gin He Arg 


Asn He 


Pro Pro His 


Leu Gin Trp 


Glu 


Val 






85 




90 




95 




Leu Asp 


Ser 


Leu Leu Val 


Gin Tyr 


Gly Val Val 


Glu Ser Cys 


Glu 


Gin 






100 




105 


110 






Val Asn 


Thr 


Asp Ser Glu 


Thr Ala 


Val Val Asn 


Val Thr Tyr 


Ser 


Ser 




115 




120 




125 






Lys Asp Gin Ala Arg Gin 


Ala Leu 


Asp Lys Leu Asn Gly Phe Gin 


Leu 


130 






135 




140 






Glu Asn 


Phe 


Thr Leu Lys 


Val Ala 


Tyr He Pro 


Asp Glu Thr 


Ala 


Ala 


145 




150 




155 




160 


Gin Gin 


Asn 


Pro Leu Gin 


Gin Pro 


Arg Gly Arg Arg Gly Leu Gly Gin 






165 




170 




175 




Arg Gly 


Ser 


Ser Arg Gin 


Gly Ser 


Pro Gly Ser 


Val Ser Lys 


Gin 


Lys 






180 




185 


190 






Pro Cys 


Asp 


Leu Pro Leu 


Arg Leu 


Leu Val Pro 


Thr Gin Phe 


Val 


Gly 




195 




200 




205 




Ala lie 


lie Gly Lys Glu 


Gly Ala 


Thr He Arg Asn He Thr Lys Gin 


210 






215 




220 






Thr Gin 


Ser 


Lys He Asp 


Val His 


Arg Lys Glu Asn Ala Gly Ala Ala 


225 




230 




235 






240 


Glu Lys 


Ser 


lie Thr He 


Leu Ser 


Thr Pro Glu Gly Thr Ser Ala Ala 






245 




250 




255 




Cys Lys 


Ser 


He Leu Glu 


He Met 


His Lys Glu 


Ala Gin Asp 


He 








260 




265 


270 






Phe Thr 


Glu 


Glu He Pro 


Leu Lys 


He Leu Ala 


His Asn Asn 


Phe 


Val 




275 




280 




285 






Gly Arg Leu 


He Gly Lys 


Glu Gly 


Arg Asn Leu 


Lys Lys He 


Glu 


Gin 


290 






295 


1 


300 






Asp Thr 


Asp 


Thr Lys He 


Thr He 


Ser Pro Leu 


Gin Glu Leu 


Thr 


Leu 


305 




310 




315 






320 


Tyr Asn 


Pro 


Glu Arg Thr 


He Thr 


Val Lys Gly Asn Val Glu Thr Cys 






325 




330 




335 




Ala Lys 


Ala 


Glu Glu Glu 


He Met 


Lys Lys He 


Arg Glu Ser 


Tyr 


Glu 






340 




345 


350 






Asn Asp 


He 


Ala Ser Met 


Asn Leu 


Gin Ala His 


Leu He Pro 


Gly 


Leu 




355 




360 




365 






Asn Leu 


Asn 


Ala Leu Gly 


Leu Phe 


Pro Pro Thr 


Ser Gly Met 


Pro 


Pro 


370 






375 




380 






Pro Thr 


Ser 


Gly Pro Pro 


Ser Ala 


Met Thr Pro 


Pro Tyr Pro 


Gin 


Phe 


385 




390 




395 




400 


Glu Gin 


Ser 


Glu Thr Glu 


Thr Val 


His Leu Phe 


He Pro Ala 


Leu 


Ser 






405 




410 




415 




Val Gly Ala 


He He Gly 


Lys Gin 


Gly Gin His 


lie Lys Gin 


Leu 


Ser 






'42 0 




425 


430 






Arg Phe Ala Gly Ala Ser 


He Lys 


He Ala Pro 


Ala Glu Ala 


Pro 


Asp 




435 




440 




445 






Ala Lys Val Arg Met Val 


He He 


Thr Gly Pro 


Pro Glu Ala 


Gin 


Phe 


450 






455 




460 






Lys Ala 


Gin 


Gly Arg He 


Tyr Gly 


Lys He Lys 


Glu Glu Asn 


Phe 


Val 


465 




470 




475 






480 


Ser Pro 


Lys 


Glu Glu Val 


Lys Leu 


Glu Ala His 


He Arg Val 


Pro 


Ser 






485 




490 




4 95 




Phe Ala Ala Gly Arg Val 


He Gly 


Lys Gly Gly Lys Thr Val 


Asn 


Glu 
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500 








505 








510 






Leu Gin Asn 


Leu 


Ser 


Ser 


Ala Glu 


Val 


Val 


Val 


Pro Arg 


Asp 


Gin 


Thr 


515 








520 








525 








Pro Asp Glu 


Asn Asp 


Gin 


Val Val 


Val 


Lys 


He 


Thr Gly His 


Phe 


Tyr 


530 








535 








540 








Ala Cys Gin 


Val 


Ala 


Gin Arg Lys 


He 


Gin 


Glu 


He Leu 


Thr 


Gin 


Val 


545 






550 








555 








560 


Lys Gin His 


Gin 


Gin 
565 


Gin 


Lys Ala 


Leu 


Gin 
570 


Ser 


Gly Pro 


Pro 


Gin 
575 


Ser 


Arg Arg Lys 

























<210> 349 
<211> 207 
<212> DNA 

<213> Homo sapiens 
<400> 349 

atgtggcagc ccctcttctt caagtggctc ttgtcctgtt gccctgggag ttctcaaatt 60 
gctgcagcag cctccaccca gcctgaggat gacatcaata cacagaggaa gaagagtcag 120 
gaaaagatga gagaagttac agactctcct gggcgacccc gagagcttac cattcctcag 180 
acttcttcac atggtgctaa cagattt 2 07 

<210> 350 
<211> 69 
<212> PRT 

<213> Homo sapiens 
<400> 350 

Met Trp Gin Pro Leu Phe Phe Lys Trp Leu Leu Ser Cys Cys Pro Gly 

15 10 15 

Ser Ser Gin He Ala Ala Ala Ala Ser Thr Gin Pro Glu Asp Asp He 

20 25 30 

Asn Thr Gin Arg Lys Lys Ser Gin Glu Lys Met Arg Glu Val Thr Asp 

35 40 45 

Ser Pro Gly Arg Pro Arg Glu Leu Thr He Pro Gin Thr Ser Ser His 

50 55 60 

Gly Ala Asn Arg Phe 
65 



<210> 351 
<211> 1012 
<212> DNA 

<213> Homo sapiens 
<400> 351 

ccctctagaa ataattttgt 
catcacacgg ccgcgtccga 
ccgatcgggc aggcgatggc 
cctaccgcct tcctcggctt 
cgcgtggtcg .ggagcgctcc 
gcggtcgacg gcgctccgat 
catcccggtg acgtcatctc 
aacgtgacat tggccgaggg 
ttcatcgggg gtgtcaacaa 
tttattttcc gagtcatgat 



ttaactttaa gaaggagata 
taacttccag ctgtcccagg 
gatcgcgggc cagatcaagc 
gggtgttgtc gacaacaacg 
ggcggcaagt ctcggcatct 
caactcggcc accgcgatgg 
ggtgacctgg caaaccaagt 
acccccggcc gaattcatgg 
acactccacc agcatcggga 
cctcgtggtg gctgcccagg 



tacatatgca tcaccatcac 60 
gtgggcaggg attcgccatt 12 0 
ttcccaccgt tcatatcggg 180 
gcaacggcgc acgagtccaa 240 
ccaccggcga cgtgatcacc 300 
cggacgcgct taacgggcat 360 
cgggcggcac gcgtacaggg 42 0 
attgggggac gctgcacact 480 
aggtgtggat cacagtcatc 54 0 
aagtgtgggg tgacgagcaa 600 
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gaggacttcg tctgcaacac actgcaaccg ggatgcaaaa atgtgtgcta tgaccacttt 660 
ttcccggtgt cccacatccg gctgtgggcc ctccagctga tcttcgtctc caccccagcg 72 0 
ctgctggtgg ccatgcatgt ggcctactac aggcacgaaa ccactcgcaa gttcaggcga 78 0 
ggagagaaga ggaatgattt caaagacata gaggacatta aaaagcagaa ggttcggata 84 0 
gaggggtgac tcgagcacca ccaccaccac cactgagatc cggctgctaa caaagcccga 900 
aaggaagctg agttggctgc tgccaccgct gagcaataac tagcataacc ccttggggcc 960 
tctaaacggg tcttgagggg ttttttgctg aaaggaggaa ctatatccgg at 1012 

<210> 352 
<211> 267 
<212> PRT 

<213> Homo sapiens 
<400> 352 



Met 


His 


His 


His 


His 


His His Thr Ala 


Ala 


Ser 


Asp 


Asn Phe Gin Leu 


1 








5 




10 






15 


Ser 


Gin 


Gly 


Gly 


Gin 


Gly Phe Ala He 


Pro 


He 


Gly 


Gin Ala Met Ala 








20 




25 








30 


lie 


Ala 


Gly 


Gin 


He 


Lys Leu Pro Thr 


Val 


His 


He 


Gly Pro Thr Ala 






35 






40 








45 


Phe 


Leu 


Gly 


Leu 


Gly 


Val Val Asp Asn 


Asn 


Gly 


Asn 


Gly Ala Arg Val 




50 








55 






60 




Gin 


Arg 


Val 


Val 


Gly 


Ser Ala Pro Ala 


Ala 


Ser 


Leu 


Gly He Ser Thr 


65 










70 




75 




80 


Gly 


Asp 


Val 


He 


Thr 


Ala Val Asp Gly 


Ala 


Pro 


He 


Asn Ser Ala Thr 










85 




90 






95 


Ala 


Met 


Ala 


Asp 


Ala 


Leu Asn Gly His 


His 


Pro 


Gly 


Asp Val He Ser 








100 




105 








110 


Val 


Thr 


Trp 


Gin 


Thr 


Lys Ser Gly Gly 


Thr 


Arg 


Thr 


Gly Asn Val Thr 






115 






120 








125 


Leu 


Ala 


Glu 


Gly 


Pro 


Pro Ala Glu Phe 


Met 


Asp 


Trp 


Gly Thr Leu His 




130 








135 






140 




Thr 


Phe 


He 


Gly 


Gly 


Val Asn Lys His 


Ser 


Thr 


Ser 


He Gly Lys Val 


145 










150 




155 




160 


Trp 


He 


Thr 


Val 


He 


Phe He Phe Arg 


Val 


Met 


He 


Leu Val Val Ala 










165 




170 






175 


Ala 


Gin 


Glu 


Val 


Trp 


Gly Asp Glu Gin 


Glu 


Asp 


Phe 


Val Cys Asn Thr 








180 




185 








190 


Leu 


Gin 


Pro 


Gly 


Cys 


Lys Asn Val Cys 


Tyr 


Asp 


His 


Phe Phe Pro Val 






195 






200 








205 


Ser 


His 


He 


Arg 


Leu 


Trp Ala Leu Gin 


Leu 


He 


Phe 


Val Ser Thr Pro 




210 








215 






220 




Ala 


Leu 


Leu 


Val 


Ala 


Met His Val Ala 


Tyr 


Tyr 


Arg 


His Glu Thr Thr 


225 










230 




235 




240 


Arg 


Lys 


Phe 


Arg 


Arg 


Gly Glu Lys Arg 


Asn 


Asp 


Phe 


Lys Asp He Glu 










245 




250 






255 


Asp 


He 


Lys 


Lys 


Gin 


Lys Val Arg He 


Glu 


Gly 












260 




265 











<210> 353 
<211> 900 
<212> DNA 

<213> Homo sapiens 
<400> 353 

atgcatcacc atcaccatca cacggccgcg tccgataact tccagctgtc ccagggtggg 60 
cagggattcg ccattccgat cgggcaggcg atggcgatcg cgggccagat caagcttccc 12 0 
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accgttcata tcgggcctac cgccttcctc ggcttgggtg ttgtcgacaa caacggcaac 18 0 
ggcgcacgag tccaacgcgt ggtcgggagc gctccggcgg caagtctcgg catctccacc 240 
ggcgacgtga tcaccgcggt cgacggcgct ccgatcaact cggccaccgc gatggcggac 300 
gcgcttaacg ggcatcatcc cggtgacgtc atctcggtga cctggcaaac caagtcgggc 360 
ggcacgcgta cagggaacgt gacattggcc gagggacccc cggccgaatt ccacgaaacc 42 0 
actcgcaagt tcaggcgagg agagaagagg aatgatttca aagacataga ggacattaaa 480 
aagcagaagg ttcggataga ggggtcgctg tggtggacgt acaccagcag catctttttc 54 0 
cgaatcatct ttgaagcagc ctttatgtat gtgttttact tcctttacaa tgggtaccac 600 
ctgccctggg tgttgaaatg tgggattgac ccctgcccca accttgttga ctgctttatt 660 
tctaggccaa cagagaagac cgtgtttacc atttttatga tttctgcgtc tgtgatttgc 720 
atgctgctta acgtggcaga gttgtgctac ctgctgctga aagtgtgttt taggagatca 780 
aagagagcac agacgcaaaa aaatcacccc aatcatgccc taaaggagag taagcagaat 840 
gaaatgaatg agctgatttc agatagtggt caaaatgcaa tcacaggttt cccaagctaa 900 



<210> 354 
<211> 299 
<212> PRT 

<213> Homo sapiens 



<400> 354 



Met 


His 


His 


His 


His 


His 


His 


Thr 


Ala 


Ala 


Ser 


Asp 


Asn 


Phe 


Gin 


Leu 


1 








5 










10 










15 




Ser 


Gin 


Gly 


Gly 


Gin 


Gly 


Phe 


Ala 


He 


Pro 


He 


Gly 


Gin 


Ala 


Met 


Ala 








20 










25 










30 






He 


Ala 


Gly 


Gin 


He 


Lys 


Leu 


Pro 


Thr 


Val 


His 


He 


Gly 


Pro 


Thr 


Ala 






35 










40 










45 








Phe 


Leu 


Gly 


Leu 


Gly 


Val 


Val 


Asp 


Asn 


Asn 


Gly 


Asn 


Gly 


Ala 


Arg 


Val 




50 










55 










60 










Gin 


Arg 


Val 


Val 


Gly 


Ser 


Ala 


Pro 


Ala 


Ala 


Ser 


Leu 


Gly 


He 


Ser 


Thr 


65 










70 










75 










80 


Gly 


Asp 


Val 


He 


Thr 


Ala 


Val 


Asp 


Gly 


Ala 


Pro 


He 


Asn 


Ser 


Ala 


Thr 










85 










90 










95 




Ala 


Met 


Ala 


Asp 


Ala 


Leu 


Asn 


Gly 


His 


His 


Pro 


Gly 


Asp 


Val 


He 


Ser 








100 










105 










110 






Val 


Thr 


Trp 


Gin 


Thr 


Lys 


Ser 


Gly 


Gly 


Thr 


Arg 


Thr 


Gly 


Asn 


Val 


Thr 






115 










120 










125 








Leu 


Ala 


Glu 


Gly 


Pro 


Pro 


Ala 


Glu 


Phe 


His 


Glu 


Thr 


Thr 


Arg 


Lys 


Phe 




130 










135 










140 










Arg 


Arg 


Gly 


Glu 


Lys 


Arg 


Asn 


Asp 


Phe 


Lys 


Asp 


He 


Glu 


Asp 


He 


Lys 


145 










150 










155 










160 


Lys 


Gin 


Lys 


Val 


Arg 


He 


Glu 


Gly 


Ser 


Leu 


Trp 


Trp 


Thr 


Tyr 


Thr 


Ser 










165 










170 










175 




Ser 


He 


Phe 


Phe 


Arg 


He 


He 


Phe 


Glu 


Ala 


Ala 


Phe 


Met 


Tyr 


Val 


Phe 








180 










185 










190 






Tyr 


Phe 


Leu 


Tyr 


Asn 


Gly 


Tyr 


His 


Leu 


Pro 


Trp 


Val 


Leu 


Lys 


Cys 


Gly 






195 










200 










205 








He 


Asp 


Pro 


Cys 


Pro 


Asn 


Leu 


Val 


Asp 


Cys 


Phe 


He 


Ser 


Arg 


Pro 


Thr 




210 










215 










220 










Glu 


Lys 


Thr 


Val 


Phe 


Thr 


He 


Phe 


Met 


He 


Ser 


Ala 


Ser 


Val 


lie 


Cys 


225 










230 










235 










240 


Met 


Leu 


Leu 


Asn 


Val 


Ala 


Glu 


Leu 


Cys 


Tyr 


Leu 


Leu 


Leu 


Lys 


val 


Cys 










245 










250 










255 




Phe 


Arg 


Arg 


Ser 


Lys 


Arg 


Ala 


Gin 


Thr 


Gin 


Lys 


Asn 


His 


Pro 


Asn 


His 








260 










265 










270 






Ala 


Leu 


Lys 


Glu 


Ser 


Lys 


Gin 


Asn 


Glu 


Met 


Asn 


Glu 


Leu 


He 


Ser 


Asp 






275 










280 










285 








Ser 


Gly 


Gin 


Asn 


Ala 


He 


Thr 


Gly 


Phe 


Pro 


Ser 
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290 295 



<210> 355 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 355 

ggagtacagc ttcaagacaa tggg 24 

<210> 356 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 356 

ccatgggaat tcattataat aattttgttc c 31 

<210> 357 
<211> 920 
<212> PRT 

<213> Homo sapiens 



<400> 357 



Met 


Gin 


His 


His 


His 


His 


His 


His 


Gly Val Gin Leu Gin 


Asp 


Asn 


Gly 


1 








5 








10 




15 




Tyr 


Asn 


Gly 


Leu 


Leu 


He 


Ala 


He 


Asn Pro Gin Val Pro 


Glu 


Asn 


Gin 








20 










25 


30 






Asn 


Leu 


He 


Ser 


Asn 


He 


Lys 


Glu 


Met He Thr Glu Ala 


Ser 


Phe 


Tyr 






35 










40 


45 










Phe 


Asn 


Ala 


Thr 




Arg 


Arg 


Val Phe Phe Arg Asn 


He 


Lys 


He 




50 










55 




60 








Leu 


He 


Pro 


Ala 


Thr 


Trp 


Lys 


Ala 


Asn Asn Asn Ser Lys 


He 


Lys 


Gin 


65 










70 






75 






80 


Glu 


Ser 


Tyr 


Glu 


Lys 


Ala 


Asn 


Val 


He Val Thr Asp Trp Tyr Gly Ala 










85 








90 




95 




His 


Gly 


Asp 


Asp 


Pro 


Tyr 


Thr 


Leu 


Gin Tyr Arg Gly Cys 


Gly Lys 


Glu 








100 










105 


110 






Gly 


Lys 


Tyr 


He 


His 


Phe 


Thr 


Pro 


Asn Phe Leu. Leu Asn Asp Asn Leu 






115 










120 


125 








Thr 


Ala 


Gly 


Tyr 


Gly 


Ser 


Arg 


Gly 


Arg Val Phe Val His 


Glu 


Trp 


Ala 




130 










135 




140 








His 


Leu 


Arg 


Trp 


Gly 


Val 


Phe 


Asp 


Glu Tyr Asn Asn Asp 


Lys 


Pro 


Phe 


145 










150 






155 






160 


Tyr 


He 


Asn 


Gly 


Gin 


Asn 


Gin 


He 


Lys Val Thr Arg Cys 


Ser 


Ser 


Asp 










165 








170 




175 




lie 


Thr 


Gly 


He 


Phe 


Val 


Cys 


Glu 


Lys Gly Pro Cys Pro 


Gin 


Glu 


Asn 








180 










185 


190 






Cys 


He 


He 


Ser 


Lys 


Leu 


Phe 


Lys 


Glu Gly Cys Thr Phe 


He 


Tyr 


Asn 






195 










200 


205 








Ser 


Thr 


Gin 


Asn 


Ala 


Thr 


Ala 


Ser 


He Met Phe Met Gin 


Ser 


Leu 


Ser 
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210 










215 




Ser 


Val 


Val 


Glu 


Phe 


Cys 




Ala 


225 










230 






Asn 


Leu 


Gin 


Asn 


Gin 


Met 


Cys 


Ser 










245 








Thr 


Asp 


Ser 


Ala Asp 


Phe 


His 


His 








260 










Leu 


Pro 


Pro 


Pro 


Pro 


Thr 


Phe 


Ser 






275 










280 


Val 


Cys 


Leu 


Val 




Asp Val 


Ser 




290 










295 




Leu 


Leu 


Gin 


Leu 


Gin 


Gin 


Ala 


Ala 


305 










310 






Glu 


He 


His 


Thr 


Phe Val Gly 


He 










325 








He 


Arg 


Ala 


Gin 


Leu 


His 


Gin 


He 








340 










Leu 


Val 


Ser 


Tyr 


Leu 


Pro 


Thr 


Thr 






355 










360 


He 


Cys 


Ser 


Gly Leu Lys 


Lys 


Gly 




370 










375 




Gly 


Lys 


Ala 


Tyr Gly Ser Val 


Met 


385 










390 






Lys 


Leu 


Leu 


Gly Asn Cys 


Leu 


Pro 










405 








He 


His 


Ser 


He Ala Leu Gly 


Ser 








420 










Leu 


Ser 


Arg 




Thr 


Gly Gly 


Leu 






435 










440 


Asn 


Ser 


Asn 


Ser 


Met 


He 


Asp 


Ala 




450 










455 




Gly 


Asp 


He 


Phe 


Gin 


Gin 


His 


He 


465 










470 






Val 


Lys 


Pro 


His 


His 


Gin 


Leu 


Lys 










485 








Val 


Gly 


Asn 


Asp 


Thr 


Met 


Phe 


Leu 








500 










Pro 


Glu 


He 


He 




Phe 


Asp 


Pro 






515 










520 


Asn 


Phe 


He 


Thr 


Asn 


Leu 


Thr 


Phe 




530 










535 




Gly 


Thr 


Ala 


Lys 


Pro 


Gly His 


Trp 


545 










550 






His 


Ser 


Leu 


Gin 


Ala 


Leu 


Lys 


Val 










565 








Ser 


Ala 


Val 


Pro 


Pro 


Ala 


Thr 


Val 








580 










Leu 


His 




Pro 


His 


Pro 


Val 


Met 






595 










600 


Phe 


Tyr 


Pro 


He 


Leu 


Asn 


Ala 


Thr 




610 










615 




Thr 


Gly 


Asp 


Pro 


Val 


Thr 


Leu 


Arg 


625 










630 






Asp 


Val 


He 




Asn 


Asp 


Gly 


He 


Ala 


Ala 


Asn 


Gly Arg Tyr 


Ser 










660 










Ser 


He 


Ser 


Thr 


Pro 


Ala 


His 


Ser 



220 



Ser 


Thr 


His 


Asn 


Gin 


Glu 


Ala 


Pro 






235 










240 


Leu 


Arg 


Ser 


Ala 


Trp 


Asp 


Val 


He 




250 










255 




Ser 


Phe 


Pro 


Met 


Asn 


Gly 


Thr 


Glu 


265 










270 






Leu 


Val 


Glu 


Ala 


Gly 


Asp 


Lys 


Val 










285 








Ser 


Lys 


Met 


Ala 


Glu 


Ala 


Asp 


Arg 








300 










Glu 


Phe 


Tyr 


Leu 


Met 


Gin 


He 


Val 






315 










320 


Ala 


Ser 


Phe 


Asp 


Ser 


Lys 


Gly 


Glu 




330 










335 




Asn 


Ser 


Asn 


Asp 


Asp 


Arg 


Lys 


Leu 


345 










350 






Val 


Ser 


Ala 


Lys 


Thr 


Asp 


lie 


Ser 










365 








Phe 


Glu 


Val 


Val 


Glu 


Lys 


Leu 


Asn 








380 










He 


Leu 


Val 


Thr 


Ser 


Gly 


Asp 


Asp 






395 










400 


Thr 


Val 




Ser 


Ser 


Gly 


Ser 


Thr 




410 










415 




Ser 


Ala 


Ala 


Pro 


Asn 


Leu 


Glu 


Glu 


425 










430 






Lys 


Phe 


Phe 


Val 


Pro 


Asp 


He 


Ser 










445 








Phe 


Ser 


Arg 


He 


Ser 


Ser 


Gly 


Thr 








460 










Gin 


Leu 


Glu 


Ser 


Thr 


Gly 


Glu 


Asn 






475 










480 


Asn 


Thr 


Val 


Thr 


Val 


Asp 


Asn 


Thr 




490 










495 




Val 


Thr 


Trp 


Gin 


Ala 


Ser 


Gly 


Pro 


505 










510 






Asp 


Gly 


Arg 


Lys 


Tyr 


Tyr 


Thr 


Asn 










525 








Arg 


Thr 


Ala 


Ser 


Leu 


Trp 


He 


Pro 








540 










Thr 


Tyr 


Thr 


Leu 


Asn 


Asn 


Thr 


His 






555 










560 


Thr 


Val 


Thr 


Ser 


Arg 


Ala 


Ser 


Asn 




570 










575 




Glu 


Ala 


Phe 


Val 


Glu 


Arg 


Asp 


Ser 


585 










590 






He 


Tyr 


Ala 


Asn 


Val 


Lys 


Gin 


Gly 










605 








Val 


Thr 


Ala 


Thr 


Val 


Glu 


Pro 


Glu 








620 










Leu 


Leu 


Asp 


Asp 


Gly 


Ala 


Gly 


Ala 






635 










640 


Tyr 


Ser 




Tyr 


Phe 


Phe 


Ser 


Phe 




650 










655 




Lys 


Val 


His 


Val 


Asn 


His 


Ser 


Pro 


665 










670 






He 


Pro 


Gly 


Ser 


His 


Ala 


Met 


Tyr 
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675 










680 










685 








Val 


Pro 


Gly 


Tyr 


Thr 


Ala Asn Gly 


Asn 


He 


Gin 


Met 




Ala 


Pro 


Arg 




690 










695 










700 












Ser 


Val 


Gly 






Glu 


Glu 


Glu 


Arg 


Lys 


Trp 


Gly 


Phe 


Ser 


Arg 


705 










710 










715 










720 


Val 


Ser 


Ser 


Gly 


Gly 
725 


Ser 


Phe 


Ser 


Val 


730 


Gly 


Val 


Pro 


Ala 


Gly 
735 


Pro 


His 


Pro 


Asp 


Val 

740 


Phe 


Pro 


Pro 


Cys 


Lys 
745 


He 


He 


Asp 


Leu 


Glu 
750 


Ala 


Val 




Val 


Glu 
755 


Glu 


Glu 


Leu 


Thr 


Leu 
760 


Ser 


Trp 


Thr 


Ala 


Pro 
765 


Gly 


Glu 


Asp 


Phe 


770 


Gin 


Gly 


Gin 


Ala 


Thr 
775 


Ser 




Glu 


He 


780 


Met 


Ser 




Ser 




Gin 




He 


Gin 


Asp Asp Phe 






Ala 


He 




Val 




Thr 


785 










790 










795 










800 


Ser 








Pro 
805 


Gin 


Gin 


Ala 


Gly 


He 
810 




Glu 


He 


Phe 


Thr 
815 


Phe 


Ser 


Pro 


Gin 


He 
820 


Ser 


Thr 


Asn 


Gly 


Pro 
825 


Glu 


His 


Gin 


Pro 


Asn 
830 


Gly 


Glu 


Thr 


His 


Glu 

835 


Ser 


His 


Arg 


He 


Tyr 
840 


Val 


Ala 


He 


Arg 


Ala 
845 


Met 


Asp 


Arg 


Asn 


Ser 
850 


Leu 


Gin 


Ser 


Ala 


Val 
855 


Ser 


Asn 


He 


Ala 


Gin 

860 


Ala 


Pro 


Leu 


Phe 


He 


Pro 


Pro 


Asn 


Ser 


Asp 


Pro 


Val 


Pro 


Ala 


Arg 


Asp 


Tyr 


Leu 


He 


Leu 


865 










870 










875 










880 


Lys 


Gly 


Val 


Leu 


Thr 
885 


Ala 


Met 


Gly 


Leu 


He 
890 


Gly 


He 


He 


Cys 


Leu 
895 


He 


He 


Val 


Val 


Thr 
900 


His 


His 


Thr 


Leu 


Ser 
905 


Arg 


Lys 


Lys 


Arg 


Ala 
910 


Asp 


Lys 


Lys 


Glu 


Asn 
915 


Gly 


Thr 


Lys 


Leu 


Leu 
920 



















<210> 358 

<211> 2773 

<212> DNA 

<213> Homo sapiens 

<400> 358 

catatgcagc atcaccacca tcaccacgga gtacagcttc aagacaatgg gtataatgga 60 
ttgctcattg caattaatcc tcaggtacct gagaatcaga acctcatctc aaacattaag 120 
gaaatgataa ctgaagcttc attttaccta tttaatgcta ccaagagaag agtatttttc 180 
agaaatataa agattttaat acctgccaca tggaaagcta ataataacag caaaataaaa 240 
caagaatcat atgaaaaggc aaatgtcata gtgactgact ggtatggggc acatggagat 300 
gatccataca ccctacaata cagagggtgt ggaaaagagg gaaaatacat tcatttcaca 360 
cctaatttcc tactgaatga taacttaaca gctggctacg gatcacgagg ccgagtgttt 420 
gtccatgaat gggcccacct ccgttggggt gtgttcgatg agtataacaa tgacaaacct 48 0 
ttctacataa atgggcaaaa tcaaattaaa gtgacaaggt gttcatctga catcacaggc 540 
atttttgtgt gtgaaaaagg tccttgcccc caagaaaact gtattattag taagcttttt 600 
aaagaaggat gcacctttat ctacaatagc acccaaaatg caactgcatc aataatgttc 660 
atgcaaagtt tatcttctgt ggttgaattt tgtaatgcaa gtacccacaa ccaagaagca 72 0 
ccaaacctac agaaccagat gtgcagcctc agaagtgcat gggatgtaat cacagactct 780 
gctgactttc accacagctt tcccatgaac gggactgagc ttccacctcc tcccacattc 840 
tcgcttgtag aggctggtga caaagtggtc tgtttagtgc tggatgtgtc cagcaagatg 900 
gcagaggctg acagactcct tcaactacaa caagccgcag aattttattt gatgcagatt 960 
gttgaaattc ataccttcgt gggcattgcc agtttcgaca gcaaaggaga gatcagagcc 1020 
cagctacacc aaattaacag caatgatgat cgaaagttgc tggtttcata tctgcccacc 108 0 
actgtatcag ctaaaacaga catcagcatt tgttcagggc ttaagaaagg atttgaggtg 114 0 
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gttgaaaaac tgaatggaaa agcttatggc tctgtgatga tattagtgac cagcggagat 1200 
gataagcttc ttggcaattg cttacccact gtgctcagca gtggttcaac aattcactcc 12 60 
attgccctgg gttcatctgc agccccaaat ctggaggaat tatcacgtct tacaggaggt 1320 
ttaaagttct ttgttccaga tatatcaaac tccaatagca tgattgatgc tttcagtaga 1380 
atttcctctg gaactggaga cattttccag caacatattc agcttgaaag tacaggtgaa 1440 
aatgtcaaac ctcaccatca attgaaaaac acagtgactg tggataatac tgtgggcaac 1500 
gacactatgt ttctagttac gtggcaggcc agtggtcctc ctgagattat attatttgat 1560 
cctgatggac gaaaatacta cacaaataat tttatcacca atctaacttt tcggacagct 1620 
agtctttgga ttccaggaac agctaagcct gggcactgga cttacaccct gaacaatacc 168 0 
catcattctc tgcaagccct gaaagtgaca gtgacctctc gcgcctccaa ctcagctgtg 1740 
cccccagcca ctgtggaagc ctttgtggaa agagacagcc tccattttcc tcatcctgtg 1800 
atgatttatg ccaatgtgaa acagggattt tatcccattc ttaatgccac tgtcactgcc 18 60 
acagttgagc cagagactgg agatcctgtt acgctgagac tccttgatga tggagcaggt 1920 
gctgatgtta taaaaaatga tggaatttac tcgaggtatt ttttctcctt tgctgcaaat 1980 
ggtagatata gcttgaaagt gcatgtcaat cactctccca gcataagcac cccagcccac 2040 
tctattccag ggagtcatgc tatgtatgta ccaggttaca cagcaaacgg taatattcag 2100 
atgaatgctc caaggaaatc agtaggcaga aatgaggagg agcgaaagtg gggctttagc 2160 
cgagtcagct caggaggctc cttttcagtg ctgggagttc cagctggccc ccaccctgat 2220 
gtgtttccac catgcaaaat tattgacctg gaagctgtaa aagtagaaga ggaattgacc 2280 
ctatcttgga cagcacctgg agaagacttt gatcagggcc aggctacaag ctatgaaata 2340 
agaatgagta aaagtctaca gaatatccaa gatgacttta acaatgctat tttagtaaat 2400 
acatcaaagc gaaatcctca gcaagctggc atcagggaga tatttacgtt ctcaccccaa 24 60 
atttccacga atggacctga acatcagcca aatggagaaa cacatgaaag ccacagaatt 252 0 
tatgttgcaa tacgagcaat ggataggaac tccttacagt ctgctgtatc taacattgcc 2580 
caggcgcctc tgtttattcc ccccaattct gatcctgtac ctgccagaga ttatcttata 2640 
ttgaaaggag ttttaacagc aatgggtttg ataggaatca tttgccttat tatagttgtg 27 00 
acacatcata ctttaagcag gaaaaagaga gcagacaaga aagagaatgg aacaaaatta 27 60 
ttataatgaa ttc 2773 

<210> 359 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 359 

tggcagcccc tcttcttcaa gtggc 25 

<210> 360 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 360 

cgccagaatt catcaaacaa atctgttagc acc 33 

<210> 361 
<211> 77 
<212> PRT 

<213> Homo sapiens 
<400> 361 

Met Gin His His His His His His Trp Gin Pro Leu Phe Phe Lys Trp 
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1 5 
Leu Leu Ser Cys Cys Pro Gly Ser 

20 

Thr Gin Pro Glu Asp Asp lie Asn 

35 40 
Lys Met Arg Glu Val Thr Asp Ser 

50 55 
He Pro Gin Thr Ser Ser His Gly 
65 70 



10 15 
Ser Gin He Ala Ala Ala Ala Ser 
25 30 
Thr Gin Arg Lys Lys Ser Gin Glu 
45 

Pro Gly Arg Pro Arg Glu Leu Thr 
60 

Ala Asn Arg Phe Val 
75 



<210> 362 
<211> 244 
<212> DNA 
<213> Homo sapiens 

<400> 362 

catatgcagc atcaccacca tcaccactgg cagcccctct tcttcaagtg gctcttgtcc 60 
tgttgccctg ggagttctca aattgctgca gcagcctcca cccagcctga ggatgacatc 12 0 
aatacacaga ggaagaagag tcaggaaaag atgagagaag ttacagactc tcctgggcga 180 
ccccgagagc ttaccattcc tcagacttct tcacatggtg ctaacagatt tgtttgatga 240 
attc • 244 

<210> 363 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 363 

Met Trp Gin Pro Leu Phe Phe Lys Trp Leu Leu Ser Cys Cys Pro Gly 

15 10 15 

Ser Ser Gin He 
20 



<210> 364 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<400> 364 

atgtggcagc ccctcttctt caagtggctc ttgtcctgtt gccctgggag ttctcaaatt 60 



<210> 365 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 365 

Gly Ser Ser Gin He Ala Ala Ala Ala Ser Thr Gin Pro Glu Asp Asp 

15 10 15 

He Asn Thr Gin 
20 



<210> 366 
<211> 60 
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<212> DNA 

<213> Homo sapiens 

<400> 366 

gggagttctc aaattgctgc agcagcctcc acccagcctg aggatgacat caatacacag 60 



<210> 367 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 367 

Lys Pro Gly His Trp Thr Tyr Thr Leu Asn Asn Thr His His Ser Leu 

15 10 15 

Gin Ala Leu Lys 
20 



■ <210> 368 
<211> 2343 
<212> DNA 
<213> Homo sapiens 

<400> 368 

attccggagc gtttgcggct tcgcttcatg gccgctctcc cgcccctcct gggatctgtg 60 

gggagctggg gagcccgcag cggcccggag ccggagctgg cgagccgagc ggagacctgt 12 0 

gcgccgcgcc tctgaggcgc agcatgtgaa gcggagacgg catccagtgg ggggcgagcc 180 

tctcagccgg ccgggatggc taccacggcc gagctcttcg aggagccttt tgtggcagat 240 

gaatatattg aacgtcttgt atggagaacc ccaggaggag gctctagagg tggacctgaa 300 

gcttttgatc ctaaaagatt attagaagaa tttgtaaatc atattcagga actccagata 360 

atggatgaaa ggattcagag gaaagtagag aaactagagc aacaatgtca gaaagaagcc 42 0 

aaggaatttg ccaagaaggt acaagagctg cagaaaagca atcaggttgc cttccaacat 480 

ttccaagaac tagatgagca cattagctat gtagcaacta aagtctgtca ccttggagac 540 

cagttagagg gggtaaacac acccagacaa cgggcagtgg aggctcagaa attgatgaaa 600 

tactttaatg agtttctaga tggagaattg aaatctgatg tttttacaaa ttctgaaaag 660 

ataaaggaag cagcagacat cattcagaag ttgcacctaa ttgcccaaga gttacctttt 720 

gatagatttt cagaagttaa atccaaaatt gcaagtaaat accatgattt agaatgccag 78 0 

ctgattcagg agtttaccag tgctcaaaga agaggtgaaa tctccagaat gagagaagta 840 

gcagcagttt tacttcattt taagggttat tcccattgtg ttgatgttta tataaagcag 900 

tgccaggagg gtgcttattt gagaaatgat atatttgaag acgctggaat actctgtcaa 960 

agagtgaaca aacaagttgg agatatcttc agtaatccag aaacagtcct ggctaaactt 1020 

attcaaaatg tatttgaaat caaactacag agttttgtga aagagcagtt agaagaatgt 1080 

aggaagtccg atgcagagca atatctcaaa aatctctatg atctgtatac aagaaccacc 1140 

aatctttcca gcaagctgat ggagtttaat ttaggtactg ataaacagac tttcttgtct 12 00 

aagcttatca aatccatttt catttcctat ttggagaact atattgaggt ggagactgga 12 60 

tatttgaaaa gcagaagtgc tatgatccta cagcgctatt atgattcgaa aaaccatcaa 1320 

aagagatcca ttggcacagg aggtattcaa gatttgaagg aaagaattag acagcgtacc 1380 

aacttaccac ttgggccaag tatcgatact catggggaga cttttctatc ccaagaagtg 1440 

gtggttaatc ttttacaaga aaccaaacaa gcctttgaaa gatgtcatag gctctctgat 1500 

ccttctgact taccaaggaa tgccttcaga atttttacca ttcttgtgga atttttatgt 15 60 

attgagcata ttgattatgc tttggaaaca ggacttgctg gaattccctc ttcagattct 1620 

aggaatgcaa atctttattt tttggacgtt gtgcaacagg ccaatactat ttttcatctt 1680 

tttgacaaac agtttaatga tcaccttatg ccactaataa gctcttctcc taagttatct 1740 

gaatgccttc agaagaaaaa agaaataatt gaacaaatgg agatgaaatt ggatactggc 18 00 

attgatagga cattaaattg tatgattgga cagatgaagc atattttggc tgcagaacag 1860 

aagaaaacag attttaagcc agaagatgaa aacaatgttt tgattcaata tactaatgcc 1920 

tgtgtaaaag tctgtgctta cgtaagaaaa caagtggaga agattaaaaa ttccatggat 198 0 
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gggaagaatg tggatacagt 
gagcatcttc aacaatattc 
gccgaatata ggaagtgtgc 
actctgcatg ctctttgcaa 
tcaggagaac aacttgctaa 
gctgattata gatctgcccg 
att 



tttgatggaa cttggagtac 
ctacagttgt atgggtggca 
caaagacttc aagattccaa 
tcttctggta gttgccccag 
tctggacaag aatatacttc 
ccttgctcga cacttcagct 



gttttcatcg acttatctat 2040 
tgttggccat ttgtgatgta 2100 
tggtattaca tctttttgat 2160 
ataatttaaa gcaagtctgc 2220 
actccttcgt acaacttcgt 2280 
gagattgaat ttacaaagga 2340 
2343 



<210> 369 
<211> 708 
<212> PRT 

<213> Homo sapiens 



<400> 369 



Met 


Ala 


Thr 


Thr 


Ala 


Glu 


Leu 


Phe 


Glu 


Glu 


Pro 


Phe 


Val 


Ala 


Asp 


Glu 


1 








5 










10 










15 




Tyr 


He 


Glu 


Arg 


Leu 


Val 


Trp 


Arg 


Thr 


Pro 


Gly 


Gly 


Gly 


Ser 


Arg 


Gly 








20 










25 










30 






Gly 


Pro 


Glu 


Ala 


Phe 


Asp 


Pro 


Lys 


Arg 


Leu 


Leu 


Glu 


Glu 


Phe 


Val 


Asn 






35 










40 










45 








His 


He 


Gin 


Glu 


Leu 


Gin 


He 


Met 


Asp 


Glu 


Arg 


He 


Gin 


Arg 


Lys 


Val 




50 










55 










60 










Glu 


Lys 


Leu 


Glu 


Gin 


Gin 


Cys 


Gin 


Lys 


Glu 


Ala 


Lys 


Glu 


Phe 


Ala 


Lys 


65 










70 










75 










80 


Lys 


Val 


Gin 


Glu 


Leu 


Gin 


Lys 


Ser 


Asn 


Gin 


Val 


Ala 


Phe 


Gin 


His 


Phe 










85 










90 










95 




Gin 


Glu 


Leu 


Asp 


Glu 


His 


He 


Ser 


Tyr 


Val 


Ala 


Thr 


Lys 


Val 


Cys 


His 








100 










105 










110 






Leu 


Gly 


Asp 


Gin 


Leu 


Glu 


Gly 


Val 


Asn 


Thr 


Pro 


Arg 


Gin 


Arg 


Ala 


Val 






115 










120 










125 








Glu 


Ala 


Gin 


Lys 


Leu 


Met 


Lys 


Tyr 


Phe 


Asn 


Glu 


Phe 


Leu 


Asp 


Gly 


Glu 




130 










135 










140 










Leu 


Lys 


Sei- 


Asp 


Val 


Phe 


Thr 


Asn 


Ser 


Glu 


Lys 


He 


Lys 


Glu 


Ala 


Ala 


145 










150 










155 










160 


Asp 


He 


Ile 


Gin 


Lys 


Leu 


His 


Leu 


He 


Ala 


Gin 


Glu 


Leu 


Pro 


Phe 


Asp 










165 










170 










175 




Arg 


Phe 


Ser 


Glu 


Val 


Lys 


Ser 


Lys 


He 


Ala 


Ser 


Lys 


Tyr 


His 


Asp 


Leu 








180 










185 










190 






Glu 


Cys 


Gin 


Leu 


He 


Gin 


Glu 


Phe 


Thr 


Ser 


Ala 


Gin 


Arg 


Arg 


Gly 


Glu 






195 










200 










205 








He 


Ser 


Arg 


Met 


Arg 


Glu 


Val 


Ala 


Ala 


Val 


Leu 


Leu 


His 


Phe 


Lys 


Gly 




210 










215 










220 








Tyr 


Ser 


His 


Cys 


Val 


Asp 


Val 


Tyr 


He 


Lys 


Gin 


Cys 


Gin 


Glu 


Gly 


Ala 


225 










230 










235 










240 


Tyr 


Leu 


Arg 


Asn 


Asp 


He 


Phe 


Glu 


Asp 


Ala 


Gly 


He 


Leu 


Cys 


Gin 


Arg 










245 










250 










255 




Val 


Asn 


Lys 


Gin 


Val 


Gly 


Asp 


He 


Phe 


Ser 


Asn 


Pro 


Glu 


Thr 


Val 


Leu 








260 










265 










270 






Ala 


Lys 


Leu 


He 


Gin 


Asn 


Val 


Phe 


Glu 


He 


Lys 


Leu 


Gin 


Ser 


Phe 


Val 






275 










280 










285 








Lys 


Glu 


Gin 


Leu 


Glu 


Glu 


Cys 


Arg 


Lys 


Ser 


Asp 


Ala 


Glu 


Gin 


Tyr 


Leu 




290 










295 










300 










Lys 


Asn 


Leu 


Tyr 


Asp 


Leu 


Tyr 


Thr 


Arg 


Thr 


Thr 


Asn 


Leu 


Ser 


Ser 


Lys 


305 










310 










315 










320 


Leu 


Met 


Glu 


Phe 


Asn 


Leu 


Gly 


Thr 


Asp 


Lys 


Gin 


Thr 


Phe 


Leu 


Ser 


Lys 










325 










330 










335 




Leu 


He 


Lys 


Ser 


He 


Phe 


He 


Ser 


Tyr 


Leu 


Glu 


Asn 


Tyr 


He 


Glu 


Val 








340 










345 










350 
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Glu 


Thr 


Gly 


Tyr 


Leu 


Lys 


Ser 


Arg 


Ser 


Ala 


Met 


He 


Leu 


Gin 


Arg 


Tyr 






355 










360 










365 








Tyr 


Asp 


Ser 


Lys 


Asn 


His 


Gin 


Lys 


Arg 


Ser 


He 


Gly 


Thr 


Gly 


Gly 


He 




370 










375 










380 










Gin 


Asp 


Leu 


Lys 


Glu 


Arg 


He 


Arg 


Gin 


Arg 


Thr 




Leu 


Pro 


Leu 


Gly 


385 










390 










395 










400 


Pro 


Ser 


He 


Asp 


Thr 


His 


Gly 


Glu 


Thr 


Phe 


Leu 


Ser 


Gin 


Glu 


Val 


Val 










405 










410 










415 




Val 


Asn 


Leu 


Leu 


Gin 


Glu 


Thr 


Lys 


Gin 


Ala 


Phe 


Glu 


Arg 


Cys 


His 


Arg 








420 










425 










430 






Leu 


Ser 


Asp 


Pro 


Ser 


Asp 


Leu 


Pro 


Arg 


Asn 


Ala 


Phe 


Arg 


He 


Phe 


Thr 






435 










440 










445 








lie 


Leu 


Val 


Glu 


Phe 


Leu 


Cys 


He 


Glu 


His 


He 


Asp 


Tyr 


Ala 


Leu 


Glu 




450 










455 










460 










Thr 


Gly 


Leu 


Ala 


Gly 


He 


Pro 


Ser 


Ser 


Asp 


Ser 


Arg 


Asn 


Ala 


Asn 


Leu 


465 










470 










475 










480 


Tyr 


Phe 


Leu 


Asp 


Val 


Val 


Gin 


Gin 


Ala 


Asn 


Thr 


He 


Phe 


His 


Leu 


Phe 










485 










490 










495 




Asp 


Lys 


Gin 


Phe 


Asn 


Asp His 


Leu 


Met 


Pro 


Leu 


He 


Ser 


Ser 


Ser 


Pro 








500 










505 










510 






Lys 


Leu 


Ser 


Glu 


Cys 


Leu 


Gin 


Lys 


Lys 


Lys 


Glu 


He 


He 


Glu 


Gin 


Met 






515 










520 










525 








Glu 


Met 


Lys 


Leu 


Asp 


Thr Gly 


He 


Asp 


Ara 


Thr 


Leu 




Cys 


Met 


He 




530 










535 










540 










Gly 


Gin 


Met 


Lvs 


His 


He 


Leu 


Ala 


Ala 


Glu 


Gin 


Lys 


Lvs 


Thr 


Asp 


Phe 


545 










550 










555 




• 






560 


Lys 


Pro 


Glu 


Asp 


Glu 


Asn 


Asn 


Val 


Leu 


He 


Gin 


Tyr 


Thr 


Asn 


Ala 












565 










570 










575 




Val 


Lys 


Val 


Cys 


Ala 


Tyr Val 


Arg 


Lys 


Gin 


Val 


Glu 


Lys 


He 




Asn 








580 










585 










590 






Ser 


Met 


Asp 


Gly 


Lys 


Asn 


Val 


Asp 


Thr 


Val 


Leu 


Met 


Glu 


Leu 


Gly 


Val 






595 










600 










605 








Arg 


Phe 


His 


Arg 


Leu 


He 


Tyr 


Glu 


His 


Leu 


Gin 


Gin 


Tyr 


Ser 


Tyr 


Ser 




610 










615 










620 










Cys 


Met 


Gly 


Gly 


Met 


Leu 


Ala 


He 


Cys 


Asp 


Val 


Ala 


Glu 


Tyr 


Arg 


Lys 


625 










630 










635 










640 


Cys 


Ala 


Lys 


Asp 


Phe 


Lys 


He 


Pro 


Met 


Val 




His 


Leu 


Phe 


Asp 


Thr 










645 










650 










655 




Leu 


His 


Ala 


Leu 


Cys 


Asn 


Leu 


Leu 


Val 


Val 


Ala 


Pro 


Asp 


Asn 


Leu 


Lys 








660 










665 










670 






Gin 


Val 


Cys 


Ser 


Gly 


Glu 


Gin 


Leu 


Ala 


Asn 


Leu 


Asp 


Lys 


Asn 


He 


Leu 






675 










680 










685 








His 


Ser 


Phe 


Val 


Gin 


Leu Arg 


Ala 


Asp 


Tyr 


Arg 


Ser 


Ala 


Arg 


Leu 


Ala 




690 










695 










700 










Arg 


His 


Phe 


Ser 



























705 



<210> 370 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<400> 370 

gtcaatcact ctcccagcat aagcacccca gcccactcta ttccagggag tcatgctatg 60 



<210> 371 
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<211> 60 

<212> DNA 

<213> Homo sapiens 

<400> 371 

agtagaattt cctctggaac tggagacatt ttccagcaac atattcagct tgaaagtaca 60 



<210> 372 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<400> 372 

ccagagactg gagatcctgt tacgctgaga ctccttgatg atggagcagg tgctgatgtt 60 



<210> 373 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<400> 373 

ttacagtctg ctgtatctaa cattgcccag gcgcctctgt ttattccccc caattctgat 60 



<210> 374 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<400> 374 

gctgtgcccc cagccactgt ggaagccttt gtggaaagag acagcctcca ttttcctcat 60 



<210> 375 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<400> 375 

aaaaacacag tgactgtgga taatactgtg ggcaacgaca ctatgtttct agttacgtgg 60 



<210> 376 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 376 

Leu Gin Ser Ala Val Ser Asn lie Ala Gin Ala Pro Leu Phe lie Pro 

15 10 15 

Pro Asn Ser Asp 
20 



<210> 377 
<211> 20 
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<212> PRT 

<213> Homo sapiens 
<400> 377 

Val Asn His Ser Pro Ser He Ser Thr Pro Ala His Ser He Pro Gly 

15 10 15 

Ser His Ala Met 
20 



<210> 378 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 378 

Pro Glu Thr Gly Asp Pro Val Thr Leu Arg Leu Leu Asp Asp Gly Ala 

15 10 15 

Gly Ala Asp Val 
20 



<210> 379 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 379 

Ala Val Pro Pro Ala Thr Val Glu Ala Phe Val Glu Arg Asp Ser Leu 

15 10 15 

His Phe Pro His 
20 



<210> 380 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 380 

Ser Arg He Ser Ser Gly Thr Gly Asp He Phe Gin Gin His He Gin 

15 10 15 

Leu Glu Ser Thr 
20 



<210> 381 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 381 

Lys Asn Thr Val Thr Val Asp Asn Thr Val Gly Asn Asp Thr Met Phe 

15 10 15 

Leu Val Thr Trp 
20 
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<210> 382 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 382 

Lys Pro Gly His Trp Thr Tyr Thr Leu Asn Asn Thr His His Ser Leu 

15 10 15 

Gin Ala Leu Lys 
20 



<210> 383 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 383 

cggcgaattc atggattggg ggacgctgc 

<210> 384 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 384 

cggcctcgag tcacccctct atccgaacct tctgc 

<210> 385 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 385 

cggcgaattc cacgaaccac tcgcaagttc ag 

<210> 386 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 386 

cggctcgagt tagcttgggc ctgtgattgc 



<210> 387 
<211> 20 
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<212> PRT 

<213> Homo sapiens 
<400> 387 

Phe Phe Lys Trp Leu Leu Ser Cys Cys Pro Gly Ser Ser Gin lie Ala 

15 10 15 

Ala Ala Ala Ser 
20 



<210> 388 

<211> 19 

<212> PRT 

<213> Homo sapiens 

<400> 388 

Leu Ser Cys Cys Pro Gly Ser Ser Gin lie Ala Ala Ala Ser Thr Gin 

15 10 15 

Pro Glu Asp 



<210> 389 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 389 

Ala Ala Ala Ala Ser Thr Gin Pro Glu Asp Asp lie Asn Thr Gin Arg 

15 10 15 

Lys Lys Ser Gin 
20 



<210> 390 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 390 

Thr Gin Pro Glu Asp Asp lie Asn Thr Gin Arg Lys Lys Ser Gin Glu 

15 10 15 

Lys Met Arg Glu 
20 



<210> 391 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 391 

Asp lie Asn Thr Gin Arg Lys Lys Ser Gin Glu Lys Met Arg Glu Val 

15 10 15 

Thr Asp Ser Pro 
20 
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<210> 392 
<211> 20 
<212> BRT 

<213> Homo sapiens 
<400> 392 

Arg Lys Lys Ser Gin Glu Lys Met Arg Glu Val Thr Asp Ser Pro Gly 

15 10 15 

Arg Pro Arg Glu 
20 



<210> 393 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 393 

Glu Lys Met Arg Glu Val Thr Asp Ser Pro Gly Arg Pro Arg Glu Leu 

15 10 15 

Thr He Pro Gin 
20 



<210> 394 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 394 

Val Thr Asp Ser Pro Gly Arg Pro Arg Glu Leu Thr He Pro Gin Thr 

15 10 15 

Ser Ser His Gly 

20 



<210> 395 
<211> 19 
<212> PRT 

<213> Homo sapiens 
<400> 395 

Gly Arg Pro Arg Glu Leu Thr He Pro Gin Thr Ser Ser His Gly Ala 

15 10 15 

Asn Arg Phe 



<210> 396 
<211> 19 
<212> PRT 

<213> Homo sapiens 
<400> 396 

Met Asn Lys Leu Tyr He Gly Asn Leu Ser Glu Asn Ala Ala Pro Ser 

15 10 15 

Asp Leu Glu 
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<210> 397 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 397 

Ser Glu Asn Ala Ala Pro Ser Asp Leu Glu Ser lie Phe Lys Asp Ala 

1 5 10 15 

Lys He Pro Val 
20 



<210> 398 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 398 

Ser He Phe Lys Asp Ala Lys He Pro Val Ser Gly Pro Phe Leu Val 

15 10 15 

Lys Thr Gly Tyr 
20 



<210> 399 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 399 

Ser Gly Pro Phe Leu Val Lys Thr Gly Tyr Ala Phe Val Asp Cys Pro 

15 10 15 

Asp Glu Ser Trp 
20 



<210> 400 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 400 

Ala Phe Val Asp Cys Pro Asp Glu Ser Trp Ala Leu Lys Ala He Glu 

15 10 15 

Ala Leu Ser Gly 
20 



<210> 401 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 401 

Ala Leu Lys Ala He Glu Ala Leu Ser Gly Lys He Glu Leu His Gly 
15 10 15 
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Lys Pro lie Glu 
20 



<210> 402 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 402 

Lys He Glu Leu His Gly Lys Pro He Glu Val Glu His Ser Val Pro 

15 10 15 

Lys Arg Gin Arg 
20 



<210> 403 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 403 

Val Glu His Ser Val Pro Lys Arg Gin Arg He Arg Lys Leu Gin He 

15 10 15 

Arg Asn He Pro 
20 



<210> 404 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 404 

He Arg Lys Leu Gin He Arg Asn 

1 5 
Val Leu Asp Ser 
20 



He Pro Pro His Leu Gin Trp Glu 
10 15 



<210> 405 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 405 

Ala Val Val Asn Val Thr Tyr Ser Ser Lys Asp Gin Ala Arg Gin Ala 

15 10 15 

Leu Asp Lys Leu 
20 



<210> 406 
<211> 20 
<212> PRT 

<213> Homo sapiens 



<400> 406 
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Asp Gin Ala Arg Gin Ala Leu Asp Lys Leu Asn Gly Phe Gin Leu Glu 

1 5 10 15 

Asn Phe Thr Leu 



<210> 407 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 407 

Asn Gly Phe Gin Leu Glu Asn Phe Thr Leu Lys Val Ala Tyr lie Pro 

15 10 15 

Asp Glu Thr Ala 
20 



<210> 408 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 408 

Lys Val Ala Tyr He Pro Asp Glu Thr Ala Ala Gin Gin Asn Pro Leu 

15 10 15 

Gin Gin Pro Arg 
20 



<210> 409 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 409 

Ala Gin Gin Asn Pro Leu Gin Gin Pro Arg Gly Arg Arg Gly Leu Gly 

15 10 15 

Gin Arg Gly Ser 
20 



<210> 410 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 410 

Gly Arg Arg Gly Leu Gly Gin Arg Gly Ser Ser Arg Gin Gly Ser Pro 

15 10 15 

Gly Ser Val Ser 
20 



<210> 411 
<211> 20 
<212> PRT 

<213> Homo sapiens 
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<400> 411 

Ser Arg Gin Gly Ser Pro Gly Ser Val Ser Lys Gin Lys Pro Cys Asp 

1 5 10 15 

Leu Pro Leu Arg 
20 



<210> 412 
<211> 20 
<212> PRT 

<213> Homo sapiens 



<400> 412 

Lys Gin Lys Pro Cys Asp Leu Pro Leu Arg Leu Leu Val Pro Thr Gin 

1 5 10 15 

Phe Val Gly Ala 
20 



<210> 413 
<211> 20 
<212> PRT 

<213> Homo sapiens 



<400> 413 

Leu Leu Val Pro Thr Gin Phe Val Gly Ala lie lie Gly Lys Glu Gly 

15 10 15 

Ala Thr He Arg 
20 



<210> 414 
<211> 20 
<212> PRT 

<213> Homo sapiens 



<400> 414 

He He Gly Lys Glu Gly Ala Thr He Arg Asn He Thr Lys Gin Thr 

15 10 15 

Gin Ser Lys He 
20 



<210> 415 
<211> 20 
<212> PRT 
<213> Homo 



sapiens 



<400> 415 

Asn He Thr Lys Gin Thr Gin Ser Lys He Asp Val His Arg Lys Glu 

15 10 15 

Asn Ala Gly Ala 
20 



<210> 416 
<211> 20 
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<212> PRT 

<213> Homo sapiens 
<400> 416 

Asp Val His Arg Lys Glu Asn Ala Gly Ala Ala Glu Lys Ser lie Thr 

15 10 15 

lie Leu Ser Thr 
20 



<210> 417 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 417 

Ala Glu Lys Ser lie Thr He Leu 

1 5 
Ala Cys Lys Ser 
20 



Ser Thr Pro Glu Gly Thr Ser Ala 

10 ( 15 • 



<210> 418 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 418 

Pro Glu Gly Thr Ser Ala Ala Cys Lys Ser He Leu Glu He Met His 

15 10 15 

Lys Glu Ala Gin 
20 



<210> 419 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 419 

He Leu Glu He Met His Lys Glu Ala Gin Asp He Lys Phe Thr Glu 

15 10 15 

Glu He Pro Leu 
20 



<210> 420 
<211> 455 
<212> DNA 
<213> Homo sapiens 

<400> 420 

gaagacatgc ttacttcccc ttcaccttcc ttcatgatgt gggaagagtg ctgcaaccca 60 
gccctagcca acgccgcatg agagggagtg tgccgagggc ttctgagaag gtttctctca 12 0 
catctagaaa gaagcgctta agatgtggca gcccctcttc ttcaagtggc tcttgtcctg 180 
ttgccctggg agttctcaaa ttgctgcagc agcctccacc cagcctgagg atgacatcaa 24 0 
tacacagagg aagaagagtc aggaaaagat gagagaagtt acagactctc ctgggcgacc 300 
ccgagagctt accattcctc agacttcttc acatggtgct aacagatttg ttcctaaaag 360 
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taaagctcta gaggccgtca aattggcaat agaagccggg ttccaccata ttgattctgc 42 0 
acatgtttac aataatgagg agcaggttgg actgg 455 

<210> 421 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 
<400> 421 

actagtgtcc gcgtggcggc ctac 24 

<210> 422 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 422 

catgagaatt catcacatgc ccttgaaggc tccc 34 

<210> 423 
<211> 161 
<212> PRT 

<213> Homo sapiens 



<400> 423 



Met 


Gin 


His His His 


His 


His 


His 


His 


Thr Ser Val Arg Val Ala 


Ala 


1 




5 










10 15 




Tyr 


Phe 


Glu Asn Phe 


Leu 


Ala 


Ala 


Trp 


Arg Pro Val Lys Ala Ser Asp 






20 








25 


30 




Gly 


Asp 


Tyr Tyr Thr 


Leu 


Ala 


Val 


Pro 


Met Gly Asp Val Pro Met 


Asp 






35 






40 




45 




Gly 


He 


Ser Val Ala 


Asp 


He 


Gly 


Ala 


Ala Val Ser Ser He Phe 


Asn 




50 






55 






60 




Ser 


Pro 


Glu Glu Phe 


Leu 


Gly 


Lys 


Ala 


Val Gly Leu Ser Ala Glu 


Ala 


65 






70 








75 


80 


Leu 


Thr 


He Gin Gin 


Tyr 


Ala 


Asp 


Val 


Leu Ser Lys Ala Leu Gly Lys 






85 










90 95 




Glu 


Val 


Arg Asp Ala 


Lys 


He 


Thr 


Pro 


Glu Ala Phe Glu Lys Leu 


Gly 






100 








105 


110 




Phe 


Pro 


Ala Ala Lys 


Glu 


He 


Ala 


Asn 


Met Cys Arg Phe Tyr Glu 


Met 






115 






120 




125 




Lys 


Pro 


Asp Arg Asp 


Val 


Asn 


Leu 


Thr 


His Gin Leu Asn Pro Lys 


Val 




130 






135 






140 






Ser 


Phe Ser Gin 


Phe 


He 


Ser 


Glu 


Asn Gin Gly Ala Phe Lys 


Gly 


145 






150 








155 


160 


Met 



















<210> 424 
<211> 489 
<212> DNA 



WO 02/47534 



178 



PCT7US01/47576 



<213> Homo sapiens 
<400> 424 

atgcagcatc accaccatca ccaccacact agtgtccgcg tggcggccta ctttgaaaac 60 
tttctcgcgg cgtggcggcc cgtgaaagcc tctgatggag attactacac cttggctgta 12 0 
ccgatgggag atgtaccaat ggatggtatc tctgttgctg atattggagc agccgtctct 180 
agcattttta attctccaga ggaattttta ggcaaggccg tggggctcag tgcagaagca 240 
ctaacaatac agcaatatgc tgatgttttg tccaaggctt tggggaaaga agtccgagat 300 
gcaaagatta ccccggaagc tttcgagaag ctgggattcc ctgcagcaaa ggaaatagcc 360 
aatatgtgtc gtttctatga aatgaagcca gaccgagatg tcaatctcac ccaccaacta 42 0 
aatcccaaag tcaaaagctt cagccagttt atctcagaga accagggagc cttcaagggc 480 
atgtgatga 489 

<210> 425 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 425 

aacaaactgt atatcggaaa cctcagcgag aa 32 

<210> 426 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 426 

ccatagaatt cattacttcc gtcttgactg agg 33 

<210> 427 
<211> 586 
<212> PRT 

<213> Homo sapiens 
<400> 427 



Met 


Gin 


His 


His 


His 


His 


His 


His 


Asn 


Lys 


Leu Tyr He Gly Asn 


Leu 


1 








5 










10 


15 




Ser 


Glu 


Asn 


Ala 


Ala 


Pro 


Ser 


Asp 


Leu 


Glu 


Ser He Phe Lys Asp 


Ala 








20 










25 




30 




Lys 


He 


Pro 


Val 


Ser 


Gly 


Pro 


Phe 


Leu 


Val 


Lys Thr Gly Tyr Ala 


Phe 






35 










40 






45 




Val 


Asp 


Cys 


Pro 


Asp' 


Glu 


Ser 


Trp 


Ala 


Leu 


Lys Ala He Glu Ala 


Leu 




50 










55 








60 




Ser 


Gly 


Lys 


He 


Glu 




His 


Gly 


Lys 


Pro 


He Glu Val Glu His 


Ser 


65 










70 










75 


80 


Val 


Pro 


Lys 


Arg 


Gin 


Arg 


He 


Arg 


Lys 


Leu 


Gin He Arg Asn He 


Pro 










85 










90 


95 




Pro 


His 


Leu 


Gin 


Trp 


Glu 


Val 


Leu 


Asp 


Ser 


Leu Leu Val Gin Tyr 


Gly 








100 










105 




110 




Val 


Val 


Glu 


Ser 


Cys 


Glu 


Gin 


Val 


Asn 


Thr 


Asp Ser Glu Thr Ala 


Val 






115 










120 






125 




Val 


Asn 


Val 


Thr 


Tyr 


Ser 


Ser 


Lys 


Asp 


Gin Ala Arg Gin Ala Leu Asp 
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130 135 



Lys 


Leu 


Asn 


Gly 


Phe 


Gin 


Leu 


Glu 


145 










150 






He 


Pro 


Asp 


Glu 


Thr 


Ala 


Ala 


Gin 










155 








Gly 


Arg 


Arg 


Gly Leu 


Gly 


Gin 


Arg 








180 










Gly 


Ser 


Val 


Ser 




Gin 




Pro 






195 










200 


Val 


Pro 


Thr 


Gin 


Phe 


Val 


Gly 


Ala 




210 










215 




He 


Arg 


Asn 


He 


Thr 


Lys 


Gin 


Thr 


225 










230 








Glu 


Asn 


Ala Gly Ala Ala 


Glu 










245 








Pro 


Glu 


Gly 


Thr 


Ser 


Ala 


Ala 


Cys 








260 










Lys 


Glu 


Ala 


Gin 


Asp 


He 


Lys 


Phe 






275 










280 


Leu 


Ala 


His 


Asn 


Asn 


Phe 


Val 


Gly 




290 










295 




Asn 


Leu 


Lys 


Lys 


lie 


Glu 


Gin 


Asp 


305 










310 






Pro 


Leu 


Gin 


Glu 


Leu 


Thr 


Leu 


Tyr 










325 








Lys 


Gly 


Asn 


Val 


Glu 


Thr 


Cys 


Ala 








340 










Lys 


He 


Arg 


Glu 


Ser 


Tyr 


Glu 


Asn 






355 










360 


Ala 


His 


Leu 


He Pro Gly Leu 


Asn 




370 










375 




Pro 


Thr 


Ser 


Gly Met 


Pro 


Pro 


Pro 


385 










390 






Thr 


Pro 


Pro 


Tyr 


Pro 


Gin 


Phe 


Glu 










405 








Leu 


Phe 


He 


Pro 


Ala 


Leu 


Ser 


Val 








420 










Gin 


His 


He 


Lys 


Gin 




Ser 


Arg 






435 










440 


Ala 


Pro 


Ala 


Glu 


Ala 


Pro 


Asp 


Ala 




450 










455 




Gly 


Pro 


Pro 


Glu Ala 


Gin 


Phe 


Lys 


465 










470 






He 


Lys 


Glu 


Glu 


Asn 


Phe 


Val 


Ser 










485 








Ala 


His 


He 


Arg 


Val 


Pro 


Ser 


Phe 








500 










Gly 


Gly 


Lys 


Thr 


Val 


Asn 


Glu 


Leu 






515 










520 


Val 


Val 


Pro 


Arg 


Asp 


Gin 


Thr 


Pro 




530 










535 




Lys 


He 


Thr 


Gly His 


Phe 


Tyr 


Ala 


545 










550 






Gin 


Glu 


He 


Leu 


Thr 


Gin 


Val 


Lys 










565 








Gin 


Ser 


Gly 


Pro 


Pro 


Gin 


Ser 


Arg 



580 



140 



Asn 


Phe 


Thr 
155 


Leu 


Lys 


Val 


Ala 


Tyr 
160 


Gin 


Asn 
170 


Pro 


Leu 


Gin 


Gin 


Pro 
175 


Arg 


Gly 


Ser 


Ser 


Arg 


Gin 


Gly 


Ser 


Pro 


185 










190 






Cys 


Asp 


Leu 


Pro 


Leu 
205 


Arg 


Leu 


Leu 


He 


He 


Gly 


Lys 
220 


Glu 


Gly 


Ala 


Thr 


Gin 


Ser 


Lys 

235 


He 


Asp 


Val 


His 


Arg 
240 


Lys 


Ser 

250 


He 


Thr 


He 


Leu 


Ser 
255 


Thr 


Lys 


Ser 


He 


Leu 


Glu 


He 


Met 


His 


265 










270 






Thr 


Glu 


Glu 


He 


Pro 
285 


Leu 


Lys 


He 


Arg 


Leu 


He 


Gly 
300 


Lys 


Glu 


Gly 


Arg 


Thr 


Asp 


Thr 
315 


Lys 


He 


Thr 


He 


Ser 
320 


Asn 


Pro 
330 


Glu 


Arg 


Thr 


He 


Thr 
335 


Val 


Lys 


Ala 


Glu 


Glu 


Glu 


He 


Met 


Lys 


345 










350 








He 


Ala 


Ser 


Met 
365 


Asn 


Leu 


Gin 


Leu 


Asn 


Ala 


Leu 
380 


Gly 


Leu 


Phe 


Pro 


Thr 


Ser 


Gly 
395 


Pro 


Pro 


Ser 


Ala 


Met 
400 


Gin 


Ser 
410 


Glu 


Thr 


Glu 


Thr 


Val 
415 


His 


Gly 


Ala 


He 


He 


Gly 


Lys 


Gin 


Gly 


425 










430 






Phe 


Ala 


Gly 


Ala 


Ser 
445 


He 


Lys 


He 


Lys 


Val 


Arg 


Met 
460 


Val 


He 


He 


Thr 


Ala 


Gin 


Gly 
475 


Arg 


He 


Tyr 


Gly 


Lys 
480 


Pro 


490 


Glu 


Glu 


Val 


Lys 


Leu 
495 


Glu 


Ala 


Ala 


Gly 


Arg 


Val 


He 


Gly 


Lys 


505 










510 






Gin 


Asn 


Leu 


Ser 


Ser 
525 


Ala 


Glu 


Val 


Asp 


Glu 


Asn 


540 


Gin 


Val 


Val 


Val 


Cys 


Gin 


Val 
555 


Ala 


Gin 


Arg 


Lys 


He 

560 


Gin 


His 
570 


Gin 


Gin 


Gin 


Lys 


Ala 
575 


Leu 



Arg Lys 
585 
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<210> 428 
<211> 1764 
<212> DNA 

<213> Homo sapiens 
<400> 428 

atgcagcatc accaccatca ccacaacaaa ctgtatatcg gaaacctcag cgagaacgcc 60 
gccccctcgg acctagaaag tatcttcaag gacgccaaga tcccggtgtc gggacccttc 120 
ctggtgaaga ctggctacgc gttcgtggac tgcccggacg agagctgggc cctcaaggcc 180 
atcgaggcgc tttcaggtaa aatagaactg cacgggaaac ccatagaagt tgagcactcg 24 0 
gtcccaaaaa ggcaaaggat tcggaaactt cagatacgaa atatcccgcc tcatttacag 300 
tgggaggtgc tggatagttt actagtccag tatggagtgg tggagagctg tgagcaagtg 360 
aacactgac.t cggaaactgc agttgtaaat gtaacctatt ccagtaagga ccaagctaga 420 
caagcactag acaaactgaa tggatttcag ttagagaatt tcaccttgaa agtagcctat 480 
atccctgatg aaacggccgc ccagcaaaac cccttgcagc agccccgagg tcgccggggg 54 0 
cttgggcaga ggggctcctc aaggcagggg tctccaggat ccgtatccaa gcagaaacca 600 
tgtgatttgc ctctgcgcct gctggttccc acccaatttg ttggagccat cataggaaaa 660 
gaaggtgcca ccattcggaa catcaccaaa cagacccagt ctaaaatcga tgtccaccgt 720 
aaagaaaatg cgggggctgc tgagaagtcg attactatcc tctctactcc tgaaggcacc 780 
tctgcggctt gtaagtctat tctggagatt atgcataagg aagctcaaga tataaaattc 840 
acagaagaga tccccttgaa gattttagct cataataact ttgttggacg tcttattggt 900 
aaagaaggaa gaaatcttaa aaaaattgag caagacacag acactaaaat cacgatatct 960 
ccattgcagg aattgacgct gtataatcca gaacgcacta ttacagttaa aggcaatgtt 1020 
gagacatgtg ccaaagctga ggaggagatc atgaagaaaa tcagggagtc ttatgaaaat 1080 
gatattgctt ctatgaatct tcaagcacat ttaattcctg gattaaatct gaacgccttg 114 0 
ggtctgttcc cacccacttc agggatgcca cctcccacct cagggccccc ttcagccatg 1200 
actcctccct acccgcagtt tgagcaatca gaaacggaga ctgttcatct gtttatccca 1260 
gctctatcag tcggtgccat catcggcaag cagggccagc acatcaagca gctttctcgc 1320 
tttgctggag cttcaattaa gattgctcca gcggaagcac cagatgctaa agtgaggatg 1380 
gtgattatca ctggaccacc agaggctcag ttcaaggctc agggaagaat ttatggaaaa 14 4 0 
attaaagaag aaaactttgt tagtcctaaa gaagaggtga aacttgaagc tcatatcaga 1500 
gtgccatcct ttgctgctgg cagagttatt ggaaaaggag gcaaaacggt gaatgaactt 1560 
cagaatttgt caagtgcaga agttgttgtc cctcgtgacc agacacctga tgagaatgac 1620 
caagtggttg tcaaaataac tggtcacttc tatgcttgcc aggttgccca gagaaaaatt 1680 
caggaaattc tgact'caggt aaagcagcac caacaacaga aggctctgca aagtggacca 17 4 0 
cctcagtcaa gacggaagta atga 17 64 

<210> 429 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 429 

ccatggaatt cattatttca atataagata atctc 35 

<210> 430 
<211> 881 
<212> PRT 

<213> Homo sapiens 
<400> 430 

Met Gin His His His His His His Gly Val Gin Leu Gin Asp Asn Gly 

15 10 15 

Tyr Asn Gly Leu Leu lie Ala He Asn Pro Gin Val Pro Glu Asn Gin 



WO 02/47534 



181 



PCT7US01/47576 









20 










25 










30 






Asn 


Leu 


He 




Asn 


He 


Lys 


Glu 


Met 


He 


Thr 


Glu 


Ala 


Ser 


Phe 


Tyr 






35 










40 










45 








Leu 


Phe 




Ala 


Thr 


Lys 


Arg Arg Val 


Phe 


Phe 


Arg 




lie 


Lys 


He 




50 










55 










60 










Leu 


He 


Pro 


Ala 


Thr 


Trp 


Lys 


Ala 


Asn 


Asn 


Asn 


Ser 


Lys 


He 


Lys 


Gin 


65 










70 










75 










80 


Glu 


Ser 


Tyr 


Glu 




Ala 


Asn 


Val 


He 


Val 


Thr 


Asp 


Trp 


Tyr 


Gly 


Ala 










85 










90 










95 




His 


Gly 


Asp 


Asp 


Pro 


Tyr Thr Leu 


Gin 


Tyr 


Arg 


Gly 


Cys 


Gly 


Lys 


Glu 








100 










105 










110 






Gly 


Lys 


Tyr 


He 


His 


Phe 


Thr 


Pro 


Asn 


Phe 


Leu 


Leu 


Asn 


Asp 


Asn 


Leu 






115 










120 










125 








Thr 


Ala 


Gly 


Tyr 


Gly Ser Arg Gly Arg 


Val 


Phe 


Val 


His 


Glu 


Trp Ala 




130 










135 










140 










His 


Leu 


Arg 


Trp 


Gly Val 


Phe Asp 


Glu 


Tyr 


Asn 




Asp 


Lys 


Pro 


Phe 


145 










150 










155 










160 


Tyr 


He 


Asn 


Gly 


Gin 


Asn 


Gin 


He 


Lys 


Val 


Thr 


Arg 


Cys 


Ser 


Ser 












165 










170 










175 




He 


Thr 


Gly 


He 


Phe 


Val 


C 


Glu 


Lys 


Gly 


Pro 


Cys 


Pro 


Gin 


Glu 


Asn 








180 










185 










190 






Cys 


He 


He 


Ser 


Lys 


Leu 


Phe 


Lys 


Glu 


Gly 


Cys 


Thr 


Phe 


He 


Tyr Asn 






195 










200 










205 








Ser 


Thr 


Gin 


Asn 


Ala 


Thr 


Ala 


Ser 


He 


Met 


Phe 


Met 


Gin 


Ser 


Leu 


Ser 




210 










215 










220 










Ser 


Val 


Val 


Glu 


Phe 


Cys 




Ala 


Ser 


Thr 


His 


Asn 


Gin 


Glu 


Ala 


Pro 


225 










230 










235 










240 


Asn 


Leu 


Gin 


Asn 


Gin 


Met 


Cys 


Ser 


Leu 


Arg 


Ser 


Ala 


Trp 


Asp 


Val 


He 










245 










250 










255 




Thr 


Asp 


Ser 


Ala 


Asp 


Phe 


His 


His 


Ser 


Phe 


Pro 


Met 


Asn 


Gly 


Thr 


Glu 








260 










265 










270 






Leu 


Pro 


Pro 


Pro 


Pro 


Thr 


Phe 


Ser 


Leu 


Val 


Glu 


Ala 


Gly 


Asp 


Lys 


Val 






275 










280 










285 








Val 


Cys 


Leu 


Val 


Leu 


Asp 


Val 


Ser 


Ser 


Lys 


Met 


Ala 


Glu 


Ala 


Asp 


Arg 




290 










295 










300 










Leu 


Leu 


Gin 


Leu 


Gin 


Gin 


Ala 


Ala 


Glu 


Phe 


Tyr 


Leu 


Met 


Gin 


He 


Val 


305 










310 










315 










320 


Glu 


He 


His 


Thr 


Phe 


Val 


Gly 


He 


Ala 


Ser 


Phe 


Asp 


Ser 


Lys 


Gly 


Glu 










325 










330 










335 




He 


Arg 


Ala 


Gin 


Leu 


His 


Gin 


He 


Asn 


Ser 


Asn 


Asp 


Asp 


Arg 


Lys 


Leu 








340 










345 










350 






Leu 


Val 


Ser 


Tyr 


Leu 


Pro 


Thr 


Thr 


Val 


Ser 


Ala 


Lys 


Thr 


Asp 


He 


Ser 






355 










360 










365 








He 


Cys 


Ser 


Gly 


Leu 


Lys 


Lys 


Gly 


Phe 


Glu 


Val 


Val 


Glu 


Lys 


Leu 


Asn 




370 










375 










380 










Gly 


Lys 


Ala 


Tyr 


Gly 


Ser 


Val 


Met 


lie 


Leu 


Val 


Thr 


Ser 


Gly 


Asp 


Asp 


385 










390 










395 










400 


Lys 


Leu 


Leu 


Gly 


Asn 


Cys 




Pro 


Thr 


Val 


Leu 


Ser 


Ser 


Gly 


Ser 


Thr 










405 










410 










415 




He 


His 


Ser 


He 


Ala 


Leu 


Gly 


Ser 


Ser 


Ala 


Ala 


Pro 


Asn 


Leu 


Glu 


Glu 








420 










425 










430 






Leu 


Ser 


Arg 


Leu 


Thr Gly Gly Leu Lys 


Phe 


Phe 


Val 


Pro 


Asp 


He 


Ser 






435 










440 










445 








Asn 


Ser 


Asn 


Ser 


Met 


He Asp Ala 


Phe 


Ser 


Arg 


He 


Ser 


Ser 


Gly 


Thr 




450 










455 










460 










Gly 


Asp 


lie 


Phe 


Gin 


Gin 


His 


He 


Gin 


Leu 


Glu 


Ser 


Thr 


Gly 


Glu 


Asn 


465 










470 










475 










480 


Val 


Lys 


Pro 


His 


His 


Gin 


Leu 


Lys 


Asn 


Thr 


Val 


Thr 


Val 


Asp 


Asn 


Thr 
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485 



Val 


Gly 


Asn 


Asp 


Thr 


Met 


Phe 


Leu 








500 










Pro 


Glu 


lie 


He 


Leu 


Phe 


Asp 


Pro 






515 










520 


Asn 


Phe 


He 


Thr 


Asn 


Leu 


Thr 


Phe 




530 










535 




Gly 


Thr 


Ala 


Lys 


Pro 


Gly 


His 


Trp 


545 










550 






His 


Ser 


Leu 


Gin 


Ala 


Leu 


Lys 


Val 










565 








Ser 


Ala 


Val 


Pro 


Pro 


Ala 


Thr 


Val 








580 










Leu 


His 


Phe 


Pro 


His 


Pro 


Val 


Met 






595 










600 


Phe 


Tyr 


Pro 


He 


Leu 


Asn 


Ala 


Thr 




610 










615 




Thr 


Gly 


Asp 


Pro 


Val 


Thr 




Arg 


625 










630 






Asp 


Val 


He 


Lys 


Asn 


Asp 


Gly 


He 










645 








Ala 


Ala 


Asn 


Gly Arg 


Tyr 


Ser 


Leu 








660 










Ser 


lie 


Ser 


Thr 


Pro 


Ala 


His 


Ser 






675 










680 


Val 


Pro 


Gly Tyr Thr 


Ala 


Asn 


Gly 




690 










695 




Lys 


Ser 


Val 


Gly Arg 


Asn 


Glu 


Glu 


705 










'710 






Val 


Ser 


Ser Gly Gly 


Ser 


Phe 


Ser 










725 








His 


Pro 


Asp 


Val 


Phe 


Pro 


Pro 


Cys 








740 










Lys 


Val 


Glu 


Glu 


Glu 


Leu 


Thr 


Leu 






755 










760 


Phe 


Asp 


Gin Gly Gin 


Ala 


Thr 


Ser 




770 










775 




Leu 


Gin 


Asn 


He 


Gin 


Asp 


Asp 


Phe 


785 










790 






Ser 


Lys 


Arg Asn 


Pro 


Gin 


Gin 


Ala 










805 








Ser 


Pro 


Gin 


He 


Ser 


Thr 


Asn 


Gly 








820 










Thr 


His 


Glu 


Ser 


His 


Arg 


He 


Tyr 






835 










840 


Asn 


Ser 


Leu 


Gin 


Ser 


Ala 


Val 


Ser 




850 










855 




lie 


Pro 


Pro 


Asn 


Ser 


Asp 


Pro 


Val 


865 










870 







Ly £ 





490 










495 




Val 


Thr 


Trp 


Gin 


Ala 


Ser 


Gly 


Pro 


505 










510 






Asp 


Gly 


Arg 


Lys 


Tyr 


Tyr 


Thr 


Asn 










525 








Arg 


Thr 


Ala 


Ser 


Leu 


Trp 


He 


Pro 








540 










Thr 


Tyr 


Thr 


Leu 


Asn 


Asn 


Thr 


His 






555 










560 


Thr 


Val 


Thr 


Ser 


Arg 


Ala 


Ser 


Asn 




570 










575 




Glu 


Ala 


Phe 


Val 


Glu 


Arg 


Asp 


Ser 


585 










590 






He 


Tyr 


Ala 


Asn 


Val 


Lys 


Gin 


Gly 










605 








Val 


Thr 


Ala 


Thr 


Val 


Glu 


Pro 


Glu 








620 










Leu 


Leu 


Asp 


Asp 


Gly 


Ala 


Gly 


Ala 






635 










640 


Tyr 


Ser 


Arg 


Tyr 


Phe 


Phe 


Ser 


Phe 




650 










655 




Lys 


Val 


His 


Val 


Asn 


His 


Ser 


Pro 


665 










670 






He 


Pro 


Gly 


Ser 


His 


Ala 


Met 


Tyr 










685 








Asn 


He 


Gin 


Met 


Asn 


Ala 


Pro 


Arg 








700 










Glu 


Arg 


Lys 


Trp 


Gly 


Phe 


Ser 


Arg 






715 










720 


Val 


Leu 


Gly 


Val 


Pro 


Ala 


Gly 


Pro' 




730 










735 




Lys 


He 


He 


Asp 


Leu 


Glu 


Ala 


Val 


745 










750 






Ser 


Trp 


Thr 


Ala 


Pro 


Gly 


Glu 


Asp 










765 








Tyr 


Glu 


He 


Arg 
780 


Met 


Ser 


Lys 


Ser 


Asn 


Asn 


Ala 


He 


Leu 




Asn 








795 










800 


Gly 


He 


Arg 


Glu 


He 


Phe 


Thr 


Phe 




810 










815 




Pro 


Glu 


His 


Gin 


Pro 


Asn 


Gly 


Glu 


825 










830 






Val 


Ala 


He 


Arg 


Ala 
845 


Met 


Asp 


Arg 


Asn 


He 


Ala 


Gin 


Ala 


Pro 


Leu 


Phe 








860 










Pro 


Ala 


Arg 


Asp 


Tyr 


Leu 


He 


Leu 






875 










880 



<210> 431 

<211> 2646 

<212> DNA 

<213> Homo sapiens 



WO 02/47534 



183 



PCT7US01/47576 



<400> 431 

atgcagcatc accaccatca ccacggagta cagcttcaag acaatgggta taatggattg 60 
ctcattgcaa ttaatcctca ggtacctgag aatcagaacc tcatctcaaa cattaaggaa 12 0 
atgataactg aagcttcatt ttacctattt aatgctacca agagaagagt atttttcaga 180 
aatataaaga ttttaatacc tgccacatgg aaagctaata ataacagcaa aataaaacaa 24 0 
gaatcatatg aaaaggcaaa tgtcatagtg actgactggt atggggcaca tggagatgat 300 
ccatacaccc tacaatacag agggtgtgga aaagagggaa aatacattca tttcacacct 360 
aatttcctac tgaatgataa cttaacagct ggctacggat cacgaggccg agtgtttgtc 42 0 
catgaatggg cccacctccg ttggggtgtg ttcgatgagt ataacaatga caaacctttc 480 
tacataaatg ggcaaaatca aattaaagtg acaaggtgtt catctgacat cacaggcatt 540 
tttgtgtgtg aaaaaggtcc ttgcccccaa gaaaactgta ttattagtaa gctttttaaa 600 
gaaggatgca cctttatcta caatagcacc caaaatgcaa ctgcatcaat aatgttcatg 660 
caaagtttat cttctgtggt tgaattttgt aatgcaagta cccacaacca agaagcacca 72 0 
aacctacaga accagatgtg cagcctcaga agtgcatggg atgtaatcac agactctgct 780 
gactttcacc acagctttcc catgaacggg actgagcttc cacctcctcc cacattctcg 840 
cttgtagagg ctggtgacaa agtggtctgt ttagtgctgg atgtgtccag caagatggca 900 
gaggctgaca gactccttca actacaacaa gccgcagaat tttatttgat gcagattgtt 960 
gaaattcata ccttcgtggg cattgccagt ttcgacagca aaggagagat cagagcccag 1020 
ctacaccaaa ttaacagcaa tgatgatcga aagttgctgg tttcatatct gcccaccact 1080 
gtatcagcta aaacagacat cagcatttgt tcagggctta agaaaggatt tgaggtggtt 114 0 
gaaaaactga atggaaaagc ttatggctct gtgatgatat tagtgaccag cggagatgat 12 00 
aagcttcttg gcaattgctt acccactgtg ctcagcagtg gttcaacaat tcactccatt 12 60 
gccctgggtt catctgcagc cccaaatctg gaggaattat cacgtcttac aggaggttta 1320 
aagttctttg ttccagatat atcaaactcc aatagcatga ttgatgcttt cagtagaatt 1380 
tcctctggaa ctggagacat tttccagcaa catattcagc ttgaaagtac aggtgaaaat 1440 
gtcaaacctc accatcaatt gaaaaacaca gtgactgtgg ataatactgt gggcaacgac 1500 
actatgtttc tagttacgtg gcaggccagt ggtcctcctg agattatatt atttgatcct 1560 
gatggacgaa aatactacac aaataatttt atcaccaatc taacttttcg gacagctagt 1620 
ctttggattc caggaacagc taagcctggg cactggactt acaccctgaa caatacccat 1680 
cattctctgc aagccctgaa agtgacagtg acctctcgcg cctccaactc agctgtgccc 1740 
ccagccactg tggaagcctt tgtggaaaga gacagcctcc attttcctca tcctgtgatg 1800 
atttatgcca atgtgaaaca gggattttat cccattctta atgccactgt cactgccaca 18 60 
gttgagccag agactggaga tcctgttacg ctgagactcc ttgatgatgg agcaggtgct 1920 
gatgttataa aaaatgatgg aatttactcg aggtattttt tctcctttgc tgcaaatggt 1980 
agatatagct tgaaagtgca tgtcaatcac tctcccagca taagcacccc agcccactct 2040 
attccaggga gtcatgctat gtatgtacca ggttacacag caaacggtaa tattcagatg 2100 
aatgctccaa ggaaatcagt aggcagaaat gaggaggagc gaaagtgggg ctttagccga 2160 
gtcagctcag gaggctcctt ttcagtgctg ggagttccag ctggccccca ccctgatgtg 2220 
tttccaccat gcaaaattat tgacctggaa gctgtaaaag tagaagagga attgacccta 2280 
tcttggacag cacctggaga agactttgat cagggccagg ctacaagcta tgaaataaga 2340 
atgagtaaaa gtctacagaa tatccaagat gactttaaca atgctatttt agtaaataca 2400 
tcaaagcgaa atcctcagca agctggcatc agggagatat ttacgttctc accccaaatt 24 60 
tccacgaatg gacctgaaca tcagccaaat ggagaaacac atgaaagcca cagaatttat 2520 
gttgcaatac gagcaatgga taggaactcc ttacagtctg ctgtatctaa cattgcccag 2580 
gcgcctctgt ttattccccc caattctgat cctgtacctg ccagagatta tcttatattg 2640 
aaataa . 2646 

<210> 432 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 432 

cgcctgctcg agtcattaat attcatcaga aaatgg 36 
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<210> 433 
<211> 371 
<212> PRT 

<213> Homo sapiens 



<400> 433 



Met 


Gin 


His 


His 


His 


His 


His His Trp Gin 


Pro Leu 


Phe 


Phe 


Lys 


Trp 


1 








5 




10 








15 




Leu 


Leu 


Ser 


Cys 


Cys 


Pro 


Gly Ser Ser Gin 


He Ala 


Ala 


Ala 


Ala 


Ser 








20 






25 






30 






Thr 


Gin 


Pro 


Glu 


Asp 


Asp 


He Asn Thr Gin 


Arg Lys 


Lys 


Ser 


Gin 


Glu 






35 








40 




45 








Lys 


Met 


Arg 


Glu 


Val 


Thr 


Asp Ser Pro Gly Arg Pro 


Arg 


Glu 


Leu 


Thr 




50 










55 


60 










He 


Pro 


Gin 


Thr 


Ser 


Ser His Gly Ala Asn 


Arg Phe 


Val 


Pro 


Lys 


Ser 


65 










70 




75 








80 


Lys 


Ala 


Leu 


Glu 


Ala 


Val 


Lys Leu Ala He 


Glu Ala 


Gly 


Phe 


His 


His 










85 




90 








95 




He 


Asp 


Ser 


Ala 


His 


Val 


Tyr Asn Asn Glu 


Glu Gin 


Val 


Gly 


Leu 


Ala 








100 






105 






110 






He 


Arg 


Ser 


Lys 


He 


Ala Asp Gly Ser Val Lys Arg 


Glu Asp 


He 


Phe 






115 








120 




125 








Tyr 


Thr 


Ser 


Lys 


Leu 


Trp Ser Asn Ser His Arg Pro 


Glu 


Leu 


Val 


Arg 




130 










135 


140 










Pro 


Ala 


Leu 


Glu 


Arg 


Ser 


Leu Lys Asn Leu 


Gin Leu 


Asp 


Tyr 


Val 


Asp 


145 










150 




155 








160 


Leu 


Tyr 


Leu 


He 


His 


Phe 


Pro Val Ser Val 


Lys Pro 


Gly 


Glu 


Glu 


Val 










165 




170 








175 




He 


Pro 


Lys 


Asp 


Glu 


Asn 


Gly Lys He Leu 


Phe Asp 


Thr 


Val 


Asp 


Leu 








180 






185 






190 






Cys 


Ala 


Thr 


Trp 


Glu 


Ala 


Met Glu Lys Cys 


Lys Asp 


Ala Gly Leu Ala 






195 








200 




205 








Lys 


Ser 


He 


Gly 


Val 


Ser 


Asn Phe Asn His 


Arg Leu 


Leu 


Glu 


Met 


He 




210 










215 


220 










Leu 


Asn 


Lys 


Pro 


Gly 


Leu 


Lys Tyr Lys Pro 


Val Cys 


Asn 


Gin 


Val 


Glu 


225 










230 




235 








240 


Cys 


His 


Pro 


Tyr 


Phe 


Asn 


Gin Arg Lys Leu 


Leu Asp 


Phe 


Cys 


Lys 


Ser 










245 




250 








255 




Lys 


Asp 


He 


Val 


Leu 


Val Ala Tyr Ser' Ala Leu Gly 


Ser 


His 


Arg 


Glu 








260 






265 






270 






Glu 


Pro 


Trp 


Val 


Asp 


Pro 


Asn Ser Pro Val 


Leu Leu 


Glu Asp 


Pro 


Val 






275 








280 




285 








Leu 


Cys 


Ala 


Leu 


Ala 


Lys 


Lys His Lys Arg 


Thr Pro 


Ala 


Leu 


He 


Ala 




290 










2 95 


300 










Leu 


Arg 


Tyr 


Gin 




Gin 


Arg Gly Val Val 


Val Leu 


Ala 


Lys 


Ser 


Tyr 


305 










310 




315 








320 


Asn 


Glu 


Gin 


Arg 


He 


Arg 


Gin Asn Val Gin 


Val Phe 


Glu 


Phe 


Gin 


Leu 










325 




330 








335 




Thr 


Ser 


Glu 


Glu 


Met 


Lys 


Ala He Asp Gly Leu Asn 


Arg Asn 


Val 


Arg 








340 






345 






350 






Tyr 


Leu 


Thr 


Leu 


Asp 


He 


Phe Ala Gly Pro 


Pro Asn 


Tyr 


Pro 


Phe 


Ser 






355 








360 




365 








Asp 


Glu 


Tyr 






















370 























<210> 434 
<211> 1119 
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<212> DNA 

<213> Homo sapiens 

<400> 434 

atgcagcatc accaccatca ccactggcag cccctcttct tcaagtggct cttgtcctgt 60 

tgccctggga gttctcaaat tgctgcagca gcctccaccc agcctgagga tgacatcaat 120 

acacagagga agaagagtca ggaaaagatg agagaagtta cagactctcc tgggcgaccc 180 

cgagagctta ccattcctca gacttcttca catggtgcta acagatttgt tcctaaaagt 240 

aaagctctag aggccgtcaa attggcaata gaagccgggt tccaccatat tgattctgca 300 

catgtttaca ataatgagga gcaggttgga ctggccatcc gaagcaagat tgcagatggc 360 

agtgtgaaga gagaagacat attctacact tcaaagcttt ggagcaattc ccatcgacca 420 

gagttggtcc gaccagcctt ggaaaggtca ctgaaaaatc ttcaattgga ctatgttgac 4 80 

ctctatctta ttcattttcc agtgtctgta aagccaggtg aggaagtgat ccca'aaagat 540 

gaaaatggaa aaatactatt tgacacagtg gatctctgtg ccacatggga ggccatggag 600 

aagtgtaaag atgcaggatt ggccaagtcc atcggggtgt ccaacttcaa ccacaggctg 660 

ctggagatga tcctcaacaa gccagggctc aagtacaagc ctgtctgcaa ccaggtggaa 720 

tgtcatcctt acttcaacca gagaaaactg ctggatttct gcaagtcaaa agacattgtt 780 

ctggttgcct atagtgctct gggatcccat cgagaagaac catgggtgga cccgaactcc 840 

ccggtgctct tggaggaccc agtcctttgt gccttggcaa aaaagcacaa gcgaacccca 900 
gccctgattg ccctgcgcta ccagctgcag cgtggggttg tggtcctggc caagagctac , 960 

aatgagcagc gcatcagaca gaacgtgcag gtgtttgaat tccagttgac ttcagaggag 1020 

atgaaagcca tagatggcct aaacagaaat gtgcgatatt tgacccttga tatttttgct 1080 

ggccccccta attatccatt ttctgatgaa tattaatga 1119 

<210> 435 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 435 

ggatccgccg ccaccatgac atccattcga gctgta 36 

<210> 436 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 436 

gtcgactcag ctggaccaca gccgcag 27 

<210> 437 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 437 

ggatccgccg ccaccatgga ctcctggacc ttctgct 37 



<210> 438 
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<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 438 

gtcgactcag aaatcctttc tcttgac 27 

<210> 439 
<211> 933 
<212> DNA 

<213> Homo sapiens 
<400> 439 

atggactcct ggaccttctg ctgtgtgtcc ctttgcatcc tggtagcaaa gcacacagat 60 

gctggagtta tccagtcacc ccggcacgag gtgacagaga tgggacaaga agtgactctg 120 

agatgtaaac caatttcagg acacgactac cttttctggt acagacagac catgatgcgg 180 

ggactggagt tgctcattta ctttaacaac aacgttccga tagatgattc agggatgccc 240 

gaggatcgat tctcagctaa gatgcctaat gcatcattct ccactctgaa gatccagccc 300 

tcagaaccca gggactcagc tgtgtacttc tgtgccagca gtttagttgg agcaaacact 3 60 

gaagctttct ttggacaagg caccagactc acagttgtag aggacctgaa caaggtgttc 420 

ccacccgagg tcgctgtgtt tgagccatca gaagcagaga tctcccacac ccaaaaggcc 480 

acactggtgt gcctggccac aggcttcttc cctgaccacg tggagctgag ctggtgggtg 540 

aatgggaagg aggtgcacag tggggtcagc acggacccgc agcccctcaa ggagcagccc 600 

gccctcaatg actccagata ctgcctgagc agccgcctga gggtctcggc caccttctgg 660 

cagaaccccc gcaaccactt ccgctgtcaa gtccagttct acgggctctc ggagaatgac 720 

gagtggaccc aggatagggc caaacccgtc acccagatcg tcagcgccga ggcctggggt 780 

agagcagact gtggctttac ctcggtgtcc taccagcaag gggtcctgtc tgccaccatc 840 

ctctatgaga tcctgctagg gaaggccacc ctgtatgctg tgctggtcag cgcccttgtg 900 

ttgatggcca tggtcaagag aaaggatttc tga 933 

<210> 440 
<211> 822 
<212> DNA 

<213> Homo sapiens 
<400> 440 

atgacatcca ttcgagctgt atttatattc ctgtggctgc agctggactt ggtgaatgga 60 

gagaatgtgg agcagcatcc ttcaaccctg agtgtccagg agggagacag cgctgttatc 120 

aagtgtactt attcagacag tgcctcaaac tacttccctt ggtataagca agaacttgga 180 

aaaagacctc agcttattat agacattcgt tcaaatgtgg gcgaaaagaa agaccaacga 240 

attgctgtta cattgaacaa gacagccaaa catttctccc tgcacatcac agagacccaa 300 

cctgaagact cggctgtcta cttctgtgca gcaagtatac tgaacaccgg taaccagttc 3 60 

tattttggga cagggacaag tttgacggtc attccaaata tccagaaccc tgaccctgcc 420 

gtgtaccagc tgagagactc taaatccagt gacaagtctg tctgcctatt caccgatttt 480 

gattctcaaa caaatgtgtc acaaagtaag gattctgatg tgtatatcac agacaaaact 540 

gtgctagaca tgaggtctat ggacttcaag agcaacagtg ctgtggcctg gagcaacaaa 600 

tctgactttg catgtgcaaa cgccttcaac aacagcatta ttccagaaga caccttcttc 660 

cccagcccag aaagttcctg tgatgtcaag ctggtcgaga aaagctttga aacagatacg 720 

aacctaaact ttcaaaacct gtcagtgatt gggttccgaa tcctcctcct gaaagtggcc 7 80 

gggtttaatc tgctcatgac gctgcggctg tggtccagct ga 822 



<210> 441 
<211> 2311 
<212> DNA 
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<213> Homo sapiens 
<400> 441 

gatttaatcc tatgacaaac taagttggtt ctgtcttcac ctgttttggt gaggttgtgt 60 
aagagttggt gtttgctcag gaagagattt aagcatgctt gcttacccag actcagagaa 12 0 
gtctccctgt tctgtcctag ctatgttcct gtgttgtgtg cattcgtctt ttccagagca 180 
aaccgcccag agtagaagat ggattggggc acgctgcaga cgatcctggg gggtgtgaac 240 
aaacactcca ccagcattgg aaagatctgg ctcaccgtcc tcttcatttt tcgcattatg 300 
atcctcgttg tggctgcaaa ggaggtgtgg ggagatgagc aggccgactt tgtctgcaac 360 
accctgcagc caggctgcaa gaacgtgtgc tacgatcact acttccccat ctcccacatc 420 
cggctatggg ccctgcagct gatcttcgtg tccagcccag cgctcctagt ggccatgcac 480 
gtggcctacc ggagacatga gaagaagagg aagttcatca agggggagat aaagagtgaa 54 0 
tttaaggaca tcgaggagat caaaacccag aaggtccgca tcgaaggctc cctgtggtgg 600 
acctacacaa gcagcatctt cttccgggtc atcttcgaag ccgccttcat gtacgtcttc 660 
tatgtcatgt acgacggctt ctccatgcag cggctggtga agtgcaacgc ctggccttgt 72 0 
cccaacactg tggactgctt tgtgtcccgg cccacggaga agactgtctt cacagtgttc 780 
atgattgcag tgtctggaat ttgcatcctg ctgaatgtca ctgaattgtg ttatttgcta 840 
attagatatt gttctgggaa gtcaaaaaag ccagtttaac gcattgccca gttgttagat 900 
taagaaatag acagcatgag agggatgagg caacccgtgc tcagctgtca aggctcagtc 960 
gccagcattt cccaacacaa agattctgac cttaaatgca accatttgaa acccctgtag 1020 
gcctcaggtg aaactccaga tgccacaatg agctctgctc ccctaaagcc tcaaaacaaa 1080 
ggcctaattc tatgcctgtc ttaattttct ttcacttaag ttagttccac tgagacccca 1140 
ggctgttagg ggttattggt gtaaggtact ttcatatttt aaacagagga tatcggcatt 1200 
tgtttctttc tctgaggaca agagaaaaaa gccaggttcc acagaggaca cagagaaggt 12 60 
ttgggtgtcc tcctggggtt ctttttgcca actttcccca cgttaaaggt gaacattggt 1320 
tctttcattt gctttggaag ttttaatctc taacagtgga caaagttacc agtgccttaa 1380 
actctgttac actttttgga agtgaaaact ttgtagtatg ataggttatt ttgatgtaaa 1440 
gatgttctgg ataccattat atgttccccc tgtttcagag gctcagattg taatatgtaa 1500 
atggtatgtc attcgctact atgatttaat ttgaaatatg gtcttttggt tatgaatact 1560 
ttgcagcaca gctgagagag gctgtctgtt gtattcattg tggtcatagc acctaacaac 1620 
attgtagcct caatcgagtg agacagacta gaagttccta gttggcttat gatagcaaat 1680 
ggcctcatgt caaatattag atgtaatttt gtgtaagaaa tacagactgg atgtaccacc 174 0 
aactactacc tgtaatgaca ggcctgtcca acacatctcc cttttccatg ctgtggtagc 18 00 
cagcatcgga aagaacgctg atttaaagag gtgagcttgg gaattttatt gacacagtac 18 60 
catttaatgg ggagacaaaa atgggggcca ggggagggag aagtttctgt cgttaaaaac 1920 
gagtttggaa agactggact ctaaattctg ttgattaaag atgagctttg tctaccttca 1980 
aaagtttgtt tggcttaccc ccttcagcct ccaatttttt aagtgaaaat ataactaata 2040 
acatgtgaaa agaatagaag ctaaggttta gataaatatt gagcagatct ataggaagat 2100 
tgaacctgaa tattgccatt atgcttgaca tggtttccaa aaaatggtac tccacatact 2160 
tcagtgaggg taagtatttt cctgttgtca agaatagcat tgtaaaagca ttttgtaata 2220 
ataaagaata gctttaatga tatgcttgta actaaaataa ttttgtaatg tatcaaatac 22 80 
atttaaaaca ttaaaatata atctctataa t 2311 

<210> 442 
<211> 226 
<212> PRT 

<213> Homo sapiens 
<400> 442 

Met Asp Trp Gly Thr Leu Gin Thr He Leu Gly Gly Val Asn Lys His 
5 10 15 

Ser Thr Ser He Gly Lys He Trp Leu Thr Val Leu Phe He Phe Arg 
20 25 30 



He Met He Leu Val Val Ala Ala Lys Glu Val Trp Gly Asp Glu Gin 
35 40 45 
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Ala Asp Phe Val Cys Asn Thr Leu 

50 55 

Tyr Asp His Tyr Phe Pro lie Ser 
65 70 



Leu lie Phe Val Ser Ser Pro Ala 
85 



Tyr Arg Arg His Glu Lys Lys Arg 
100 



Gin Pro Gly Cys Lys Asn Val Cys 
60 

His lie Arg Leu Trp Ala Leu Gin 

75 80 

Leu Leu Val Ala Met His Val Ala 
90 95 

Lys Phe lie Lys Gly Glu lie Lys 
105 ' 110 



Ser Glu Phe Lys Asp lie Glu Glu He Lys Thr Gin Lys Val Arg He 
115 120 125 



Glu Gly Ser Leu Trp Trp Thr Tyr Thr Ser Ser He Phe Phe Arg Val 
130 135 140 



He Phe Glu Ala Ala Phe Met Tyr Val Phe Tyr Val Met Tyr Asp Gly 
145 150 155 160 



Phe Ser Met Gin Arg Leu Val Lys 
165 

Thr Val Asp Cys Phe Val Ser Arg 
180 

Val Phe Met He Ala Val Ser Gly 
195 200 

Glu Leu Cys Tyr Leu Leu He Arg 
210 215 



Cys Asn Ala Trp Pro Cys Pro Asn 
170 175 

Pro Thr Glu Lys Thr Val Phe Thr 
185 190 

He Cys He Leu Leu Asn Val Thr 
205 

Tyr Cys Ser Gly Lys Ser Lys Lys 
220 



Pro Val 
225 



<210> 443 
<211> 23 
<212> PRT 

<213> Homo sapiens 
<400> 443 

Val Lys Leu Cys Gly He Asp Pro Cys Pro Asn Leu Val Asp Cys Phe 



He Ser Arg Pro Gly Cys Gly 



<210> 444 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
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<400> 444 

caatcaggca tgcacaacaa actgtatatc ggaaac 

<210> 445 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 445 

cgtcaagatc ttcattactt ccgtcttgac 

<210> 446 
<211> 579 
<212> PRT 

<213> Homo sapiens 
<400> 446 

Met Asn Lys Leu Tyr He Gly Asn Leu Ser Glu Asn Ala Ala Pro Ser 
5 10 15 

Asp Leu Glu Ser He Phe Lys Asp Ala Lys He Pro Val Ser Gly Pro 
20 25 30 

Phe Leu Val Lys Thr Gly Tyr Ala Phe Val Asp Cys Pro Asp Glu Ser 
35 40 45 

Trp Ala Leu Lys Ala He Glu Ala Leu Ser Gly Lys He Glu Leu His 
50 55 60 

Gly Lys Pro He Glu Val Glu His Ser Val Pro Lys Arg Gin Arg He 



Arg Lys Leu Gin He Arg Asn He Pro Pro His Leu Gin Trp Glu Val 
85 90 95 

Leu Asp Ser Leu Leu Val Gin Tyr Gly Val Val Glu Ser Cys Glu Gin 
100 105 110 

Val Asn Thr Asp Ser Glu Thr Ala Val Val Asn Val Thr Tyr Ser Ser 
115 120 125 

Lys Asp Gin Ala Arg Gin Ala Leu Asp Lys Leu Asn Gly Phe Gin Leu 
130 135 140 

Glu Asn Phe Thr Leu Lys Val Ala Tyr He Pro Asp Glu Thr Ala Ala 
145 150 155 160 

Gin Gin Asn Pro Leu Gin Gin Pro Arg Gly Arg Arg Gly Leu Gly Gin 
165 170 175 

Arg Gly Ser Ser Arg Gin Gly Ser Pro Gly Ser Val Ser Lys Gin Lys 
180 185 190 
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Pro Cys Asp Leu Pro Leu Arg Leu Leu Val Pro Thr Gin Phe Val Gly 
195 200 205 

Ala He He Gly Lys Glu Gly Ala Thr He Arg Asn He Thr Lys Gin 
210 215 220 

Thr Gin Ser Lys He Asp Val His Arg Lys Glu Asn Ala Gly Ala Ala 
225 230 235 240 

Glu Lys Ser He Thr He Leu Ser Thr Pro Glu Gly Thr Ser Ala Ala 
245 250 255 

Cys Lys Ser He Leu Glu He Met His Lys Glu Ala Gin Asp He Lys 
260 265 270 

Phe Thr Glu Glu He Pro Leu Lys He Leu Ala His Asn Asn Phe Val 
275 280 285 

Gly Arg Leu He Gly Lys Glu Gly Arg Asn Leu Lys Lys lie Glu Gin 
290 295 300 

Asp Thr Asp Thr Lys lie Thr He Ser Pro Leu Gin Glu Leu Thr Leu 
305 310 315 320 

Tyr Asn Pro Glu Arg Thr He Thr Val Lys Gly Asn Val Glu Thr Cys 
325 330 . 335 

Ala Lys Ala Glu Glu Glu He Met Lys Lys He Arg Glu Ser Tyr Glu 
340 345 350 

Asn Asp lie Ala Ser Met Asn Leu Gin Ala His Leu He Pro Gly Leu 
355 360 365 

Asn Leu Asn Ala Leu Gly Leu Phe Pro Pro Thr Ser Gly Met Pro Pro 
370 375 380 

Pro Thr Ser Gly Pro Pro Ser Ala Met Thr Pro Pro Tyr Pro Gin Phe 
385 390 395 400 

Glu Gin Ser Glu Thr Glu Thr Val His Leu Phe He Pro Ala Leu Ser 
405 410 415 

Val Gly Ala He He Gly Lys Gin Gly Gin His He Lys Gin Leu Ser 
420 425 430 

Arg Phe Ala Gly Ala Ser He Lys He Ala Pro Ala Glu Ala Pro Asp 
435 440 445 

Ala Lys Val Arg Met Val He He Thr Gly Pro Pro Glu Ala Gin Phe 
450 455 460 

Lys Ala Gin Gly Arg He Tyr Gly Lys He Lys Glu Glu Asn Phe Val 
465 470 475 480 

Ser Pro Lys Glu Glu Val Lys Leu Glu Ala His He Arg Val Pro Ser 
485 490 495 



Phe Ala Ala Gly Arg Val He Gly Lys Gly Gly Lys Thr Val Asn Glu 
500 505 510 
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Leu Gin Asn Leu Ser Ser Ala Glu 
515 520 

Pro Asp Glu Asn Asp Gin Val Val 
530 535 

Ala Cys Gin Val Ala Gin Arg Lys 
545 550 

Lys Gin His Gin Gin Gin Lys Ala 
565 

Arg Arg Lys 



Val Val Val Pro Arg Asp Gin Thr 
525 

Val Lys He Thr Gly His Phe Tyr 
540 

He Gin Glu He Leu Thr Gin Val 
555 560 

Leu Gin Ser Gly Pro Pro Gin Ser 
570 575 



<210> 447 

<211> 1743 

<212> DNA 

<213> Homo sapiens 

<400> 447 

atgaacaaac tgtatatcgg aaacctcagc 
atcttcaagg acgccaagat cccggtgtcg 
ttcgtggact gcccggacga gagctgggcc 
atagaactgc acgggaaacc catagaagtt 
cggaaacttc agatacgaaa tatcccgcct 
ctagtccagt atggagtggt ggagagctgt 
gttgtaaatg taacctattc cagtaaggac 
ggatttcagt tagagaattt caccttgaaa 
cagcaaaacc ccttgcagca gccccgaggt 
aggcaggggt ctccaggatc cgtatccaag 
ctggttccca cccaatttgt tggagccatc 
atcaccaaac agacccagtc taaaatcgat 
gagaagtcga ttactatcct ctctactcct 
ctggagatta tgcataagga agctcaagat 
attttagctc ataataactt tgttggacgt 
aaaattgagc aagacacaga cactaaaatc 
tataatccag aacgcactat tacagttaaa 
gaggagatca tgaagaaaat cagggagtct 
caagcacatt taattcctgg attaaatctg 
gggatgccac ctcccacctc agggccccct 
gagcaatcag aaacggagac tgttcatctg 
atcggcaagc agggccagca catcaagcag 
attgctccag cggaagcacc agatgctaaa 
gaggctcagt tcaaggctca gggaagaatt 
agtcctaaag aagaggtgaa acttgaagct 
agagttattg gaaaaggagg caaaacggtg 
gttgttgtcc ctcgtgacca gacacctgat 
ggtcacttct atgcttgcca ggttgcccag 
aagcagcacc aacaacagaa ggctctgcaa 
tga 



gagaacgccg ccccctcgga cctagaaagt 60 
ggacccttcc tggtgaagac tggctacgcg 120 
ctcaaggcca tcgaggcgct ttcaggtaaa 180 
gagcactcgg tcccaaaaag gcaaaggatt 240 
catttacagt gggaggtgct ggatagttta 300 
gagcaagtga acactgactc ggaaactgca 360 
caagctagac aagcactaga caaactgaat 420 
gtagcctata tccctgatga aacggccgcc 480 
cgccgggggc ttgggcagag gggctcctca 540 
cagaaaccat gtgatttgcc tctgcgcctg 600 
ataggaaaag aaggtgccac cattcggaac 660 
gtccaccgta aagaaaatgc gggggctgct 720 
gaaggcacct ctgcggcttg taagtctatt 780 
ataaaattca cagaagagat ccccttgaag 840 
cttattggta aagaaggaag aaatcttaaa 900 
acgatatctc cattgcagga attgacgctg 960 
ggcaatgttg agacatgtgc caaagctgag 1020 
tatgaaaatg atattgcttc tatgaatctt 1080 
aacgccttgg gtctgttccc acccacttca 1140 
tcagccatga ctcctcccta cccgcagttt 1200 
tttatcccag ctctatcagt cggtgccatc 12 60 
ctttctcgct ttgctggagc ttcaattaag 1320 
gtgaggatgg tgattatcac tggaccacca 13 80 
tatggaaaaa ttaaagaaga aaactttgtt 1440 
catatcagag tgccatcctt tgctgctggc 1500 
aatgaacttc agaatttgtc aagtgcagaa 1560 
gagaatgacc aagtggttgt caaaataact 162 0 
agaaaaattc aggaaattct gactcaggta 1680 
agtggaccac ctcagtcaag acggaagtaa 17 4 0 
1743 



<210> 448 
<211> 35 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> PGR primer 
<400> 448 

cgtactagca tatgaacaaa ctgtatatcg gaaac 35 



<210> 449 
<211> 579 
<212> PRT 

<213> Homo sapiens 
<400> 449 



Met 


Asn 


Lys 


Leu 


Tyr 
5 


He 


Gly 


Asn 


Leu 


Ser 
10 


Glu Asn Ala Ala Pro Ser 
15 


Asp 


Leu 


Glu 


Ser 
20 


He 


Phe 


Lys 


Asp 


Ala 
25 


Lys 


He Pro Val Ser Gly Pro 
30 


Phe 


Leu 


Val 
35 


Lys 


Thr 


Gly 


Tyr 


Ala 
40 


Phe 


Val 


Asp Cys Pro Asp Glu Ser 
45 


Trp 


Ala 
50 


Leu 


Lys 


Ala 


He 


Glu 
55 


Ala 


Leu 


Ser 


Gly Lys He Glu Leu His 
60 


Gly 
65 


Lys 


Pro 


lie 


Glu 


Val 
70 


Glu 


His 


Ser 


Val 


Pro Lys Arg Gin Arg He 
75 80 


Arg 


Lys 


Leu 


Gin 


He 
85 


Arg 


Asn 


He 


Pro 


Pro 
90 


His Leu Gin Trp Glu Val 
95 


Leu 


Asp 


Ser 


Leu 
100 


Leu 


Val 


Gin 


Tyr 


Gly 
105 


Val 


Val Glu Ser Cys Glu Gin 
110 


Val 


Asn 


Thr 
115 


Asp 


Ser 


Glu 


Thr 


Ala 
120 


Val 


Val 


Asn Val Thr Tyr Ser Ser 
125 


Lys 


Asp 
130 


Gin 


Ala 


Arg 


Gin 


Ala 
135 


Leu 


Asp 


Lys 


Leu Asn Gly Phe Gin Leu 
140 


Glu 

145 


Asn 


Phe 


Thr 


Leu 


Lys 
150 


Val 


Ala 


Tyr 


He 


Pro Asp Glu Thr Ala Ala 
155 160 


Gin 


Gin 


Asn 


Pro 


Leu 
165 


Gin 


Gin 


Pro 


Arg 


Gly Arg Arg Gly Leu Gly Gin 
170 175 


Arg 


Gly 


Ser 


Ser 
180 


Arg 


Gin 


Gly 


Ser 


Pro 
185 


Gly 


Ser Val Ser Lys Gin Lys 
190 


Pro 


Cys 


Asp 
195 


Leu 


Pro 


Leu 


Arg 


Leu 
200 


Leu 


Val 


Pro Thr Gin Phe Val Gly 
205 


Ala 

■ Thr 
225 


He 
210 
Gin 


He 
Ser 


Gly 
Lys 


Lys 
He 


Glu 

Asp 
230 


Gly 
215 
Val 


Ala 
His 


Thr 
Arg 


He 

Lys 


Arg Asn He Thr Lys Gin 
220 

Glu Asn Ala Gly Ala Ala 
235 240 
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Glu Lys Ser He Thr He Leu Ser Thr Pro Glu Gly Thr Ser Ala Ala 
245 250 255 

Cys Lys Ser He Leu Glu He Met His Lys Glu Ala Gin Asp He Lys 
260 265 270 



Phe Thr Glu Glu He Pro Leu Lys 
275 280 

Gly Arg Leu He Gly Lys Glu Gly 
290 295 

Asp Thr Asp Thr Lys He Thr He 
305 310 

Tyr Asn Pro Glu Arg Thr He Thr 
325 

Ala Lys Ala Glu Glu Glu He Met 
340 



He Leu Ala His Asn Asn Phe Val 
285 



Arg Asn Leu Lys Lys He Glu Gin 
300 



Ser Pro Leu Gin Glu Leu Thr Leu 
315 320 



Val Lys Gly Asn Val Glu Thr Cys 
330 335 



Lys Lys He Arg Glu Ser Tyr Glu 
345 350 



Asn Asp He Ala Ser Met Asn Leu Gin Ala His Leu He Pro Gly Leu 
355 360 365 

Asn Leu Asn Ala Leu Gly Leu Phe Pro Pro Thr Ser Gly Met Pro Pro 
370 375 380 

Pro Thr Ser Gly Pro Pro Ser Ala Met Thr Pro Pro Tyr Pro Gin Phe 
385 " 390 395 400 

Glu Gin Ser Glu Thr Glu Thr Val His Leu Phe lie Pro Ala Leu Ser 
405 410 415 

Val Gly Ala He He Gly Lys Gin Gly Gin His He Lys Gin Leu Ser 
420 425 430 

Arg Phe Ala Gly Ala Ser He Lys He Ala Pro Ala Glu Ala Pro Asp 
435 440 445 

Ala Lys Val Arg Met Val He He Thr Gly Pro Pro Glu Ala Gin Phe 

450 455 460 

Lys Ala Gin Gly Arg He Tyr Gly Lys He Lys Glu Glu Asn Phe Val 

465 470 475 480 

Ser Pro Lys Glu Glu Val Lys Leu Glu Ala His He Arg Val Pro Ser 
485 490 495 

Phe Ala Ala Gly Arg Val He Gly Lys Gly Gly Lys Thr Val Asn Glu 
500 505 510 

Leu Gin Asn Leu Ser Ser Ala Glu Val Val Val Pro Arg Asp Gin Thr 
515 520 525 

Pro Asp Glu Asn Asp Gin Val Val Val Lys He Thr Gly His Phe Tyr 
530 535 540 

Ala Cys Gin Val Ala Gin Arg Lys He Gin Glu He Leu Thr Gin Val 
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Lys Gin His Gin Gin Gin Lys Ala Leu Gin Ser Gly Pro Pro Gin Ser 
565 570 575 



<210> 450 

<211> 1743 

<212> DNA 

<213> Homo sapiens 

<400> 450 

atgaacaaac tgtatatcgg aaacctcagc gagaacgccg ccccctcgga cctagaaagt 60 
atcttcaagg acgccaagat cccggtgtcg ggacccttcc tggtgaagac tggctacgcg 120 
ttcgtggact gcccggacga gagctgggcc ctcaaggcca tcgaggcgct ttcaggtaaa 180 
atagaactgc acgggaaacc catagaagtt gagcactcgg tcccaaaaag gcaaaggatt 240 
cggaaacttc agatacgaaa tatcccgcct catttacagt gggaggtgct ggatagttta 300 
ctagtccagt atggagtggt ggagagctgt gagcaagtga acactgactc ggaaactgca 360 
gttgtaaatg taacctattc cagtaaggac caagctagac aagcactaga caaactgaat 42 0 
ggatttcagt tagagaattt caccttgaaa gtagcctata tccctgatga aacggccgcc 480 
cagcaaaacc ccttgcagca gccccgaggt cgccgggggc ttgggcagag gggctcctca 54 0 
aggcaggggt ctccaggatc cgtatccaag cagaaaccat gtgatttgcc tctgcgcctg 600 
ctggttccca cccaatttgt tggagccatc ataggaaaag aaggtgccac cattcggaac 660 
atcaccaaac agacccagtc taaaatcgat gtccaccgta aagaaaatgc gggggctgct 720 
gagaagtcga ttactatcct ctctactcct gaaggcacct ctgcggcttg taagtctatt 780 
ctggagatta tgcataagga agctcaagat ataaaattca cagaagagat ccccttgaag 840 
attttagctc ataataactt tgttggacgt cttattggta aagaaggaag aaatcttaaa 900 
aaaattgagc aagacacaga cactaaaatc acgatatctc cattgcagga attgacgctg 960 
tataatccag aacgcactat tacagttaaa ggcaatgttg agacatgtgc caaagctgag 1020 
gaggagatca tgaagaaaat cagggagtct tatgaaaatg atattgcttc tatgaatctt 1080 
caagcacatt taattcctgg attaaatctg aacgccttgg gtctgttccc acccacttca 1140 
gggatgccac ctcccacctc agggccccct tcagccatga ctcctcccta cccgcagttt 12 00 
gagcaatcag aaacggagac tgttcatctg tttatcccag ctctatcagt cggtgccatc 1260 
atcggcaagc agggccagca catcaagcag ctttctcgct ttgctggagc ttcaattaag 132 0 
attgctccag cggaagcacc agatgctaaa gtgaggatgg tgattatcac tggaccacca 1380 
gaggctcagt tcaaggctca gggaagaatt tatggaaaaa ttaaagaaga aaactttgtt 1440 
agtcctaaag aagaggtgaa acttgaagct catatcagag tgccatcctt tgctgctggc 1500 
agagttattg gaaaaggagg caaaacggtg aatgaacttc agaatttgtc aagtgcagaa 1560 
gttgttgtcc ctcgtgacca gacacctgat gagaatgacc aagtggttgt caaaataact 1620 
ggtcacttct atgcttgcca ggttgcccag agaaaaattc aggaaattct gactcaggta 1680 
aagcagcacc aacaacagaa ggctctgcaa agtggaccac ctcagtcaag acggaagtaa 1740 
tga ' 1743 



<210> 451 
<211> 25 
<212> PRT 

<213> Homo sapiens 
<400> 451 

Leu Gly Lys Glu Val Arg Asp Ala Lys He Thr Pro Glu Ala Phe Glu 



Lys Leu Gly Phe Pro Ala Ala Lys Glu 
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<210> 452 
<211> 25 
<212> PRT 

<213> Homo sapiens 
<400> 452 

Lys Ala Ser Asp Gly Asp Tyr Tyr Thr Leu Ala Val Pro Met Gly Asp 



Val Pro Met Asp Gly He Ser Val Ala 



<210> 453 
<211> 16 
<212> PRT 

<213> Homo sapiens 
<400> 453 

Pro Asp Arg Asp Val Asn Leu Thr His Gin Leu Asn Pro Lys Val Lys 



<210> 454 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 454 

Lys He Ala Pro Ala Glu Ala Pro Asp Ala Lys Val Arg Met Val He 



He Thr Gly Pro 



<210> 455 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 455 

Pro Asp Glu Thr Ala Ala Gin Gin Asn Pro Leu Gin Gin Pro Arg Gly 



Arg Arg Gly Leu 



<210> 456 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 456 

Arg Thr He Thr Val Lys Gly Asn Val Glu Thr Cys Ala Lys Ala Glu 
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Glu Glu He Met 
20 

<210> 457 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 457 

Ala Phe Val Asp Cys Pro Asp Glu Ser Trp Ala Leu Lys Ala He Glu 



Ala Leu Ser Gly 



<210> 458 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 458 

He Arg Lys Leu Gin He Arg Asn He Pro Pro His Leu Gin Trp Glu 



Val Leu Asp Ser 
20 



<210> 459 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 459 

Ala Gin Gin Asn Pro Leu Gin Gin Pro Arg Gly Arg Arg Gly Leu Gly 



Gin Arg Gly Ser 



<210> 460 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 460 

Asp Val His Arg Lys Glu Asn Ala Gly Ala Ala Glu Lys Ser He Thr 



He Leu Ser Thr 
20 



<210> 461 
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<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 461 

Leu Tyr Asn Pro Glu Arg Thr He 
5 

Cys Ala Lys Ala 
20 



Thr Val Lys Gly Asn Val Glu Thr 
10 15 



<210> 462 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 462 

Glu Glu Glu He Met Lys Lys He Arg Glu Ser Tyr Glu Asn Asp He 



Ala Ser Met Asn 



<210> 463 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 463 

Leu Asn Ala Leu Gly Leu Phe Pro Pro Thr Ser Gly Met Pro Pro Pro 



Thr Ser Gly Pro 



<210> 464 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 464 

Lys He Ala Pro Ala Glu Ala Pro Asp Ala Lys Val Arg Met Val He 



He Thr Gly Pro 

• 20 

<210> 465 
<211> 18 
<212> PRT 

<213> Homo sapiens 
<400> 465 

Thr Gly Tyr Ala Phe Val Asp Cys Pro Asp Glu Ser Trp Ala Leu Lys He 
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Glu 



<210> 466 
<211> 11 
<212> PRT 

<213> Homo sapiens 
<400> 466 

Phe Val Asp Cys Pro Asp Glu Ser Trp Ala Leu 



<210> 467 

<211> 33 

<212> DNA 

<213> Homo sapiens 

<400> 467 

ttcgtggact gcccggacga gagctgggcc etc 



<210> 468 
<211> 24 
<212> PRT 

<213> Homo sapiens 
<400> 468 

lie Pro Asp Glu Met Ala Ala Gin Gin Asn Pro Leu Gin Gin Pro Arg 



Gly Arg Arg Gly Leu Gly Gin Arg 



<210> 469 
<211> 24 
<212> PRT 

<213> Homo sapiens 
<400> 469 

lie Pro Asp Glu Thr Ala Ala Gin Gin Asn Pro Ser Pro Gin Leu Arg 
5 10 15 



Gly Arg Arg Gly Pro Gly Gin Arg 
20 



